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Reasoning  by  Augmenting  a 
Description  Logic  Reasoner  (Phase  1) 


1  Introduction 

This  is  the  final  report  on  the  DARPO  IPTO  project  Reasoning  by  Augmenting  a  Description 
Logic  Reasoner  (Phase  1),  contract  HR0011-05-C-0094,  providing  information  on  the  objectives 
of  the  program  and  how  these  objectives  were  addressed.  The  Reasoning  by  Augmenting  a 
Description  Logic  Reasoner  Project  (Phase  1)  was  designed  to  provide  languages  and  tools  for 
reasoning  about  information  expressed  in  expressive  Description  Logics  or  ontology  languages 
similar  to  the  W3C  OWL  DL  Web  Ontology  Language.  The  results  of  the  project  were  designs 
of  new  Description  Logics  and  new  reasoning  methods  and  optimizations  for  these  Description 
Logics  as  well  as  the  FaCT++  Description  Logic  reasoner. 


2  Task  Objectives  and  Technical  Problems 

The  first  objective  of  the  project  was  the  design  of  new  reasoning  methods  and  optimizations  for 
reasoning  in  expressive  representation  languages  (i.e.,  Description  Logics)  similar  to  the  W3C 
OWL  DL  Web  Ontology  Language  (http://www.w3.org/TR/owl-reD.  including  optimized 
reasoning  methods  for  hybrid  datatype  extensions  of  Description  Logics.  The  project  also  aimed 
to  catalogue  known  relevant  methods  and  optimizations  and  to  analyze  them  for  suitability. 

A  closely  related  objective  was  the  implementation  of  an  optimized  inference  engine  for  the 
expressive  Description  Logic  and  the  hybrid  extensions.  The  inference  engine  was  to 
incorporate  at  least  the  optimizations  in  current  highly  optimized  systems,  such  as  DLP,  FaCT, 
and  RACER,  that  are  applicable  to  expressive  Description  Logics,  as  well  as  new  optimizations 
designed  for  these  languages.  The  performance  of  the  inference  engine  was  to  be  empirically 
analyzed  on  a  variety  of  inputs,  including  existing  ontologies. 

The  third  objective  of  the  project  was  the  design  and  implementation  of  an  interface  to  control 
the  inference  engine  and  examine  its  performance.  This  interface  was  to  allow  the  support  of 
larger  reasoning  tasks,  both  by  tuning  the  inference  engine  for  a  particular  task  and  by  allowing  a 
larger  system  to  inspect  the  inference  engine's  deliberations. 
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3  General  Methods  and  Technical  Results 

An  analysis  of  Description  Logic  constructs  similar  to  those  in  the  W3C  OWL  DL  Web 
Ontology  Language  was  performed.  Several  additions  to  OWL  DL  were  identified  as  useful  for 
the  purposes  of  the  project. 

One  of  these  extensions  involves  the  addition  of  a  form  of  rules  to  OWL  DL.  This  extension  is 
documented  in  the  paper  "OWL  rules:  A  proposal  and  prototype  implementation"  by  Ian 
Horrocks,  Peter  F.  Patel-Schneider,  Sean  Bechhofer,  and  Dmitry  Tsarkov,  Journal  of  Web 
Semantics,  3(l):23-40,  July  2005.  This  paper  is  available  at 

http://www.websemanticsjournal.org/ps/pub/2005-2  and  as  Appendix  A. 

A  second  extension  involves  the  addition  of  datatypes  and  datatype  predicates  to  OWL  DL.  This 
extension  is  documented  in  the  paper  "Introducing  Customised  Datatypes  and  Datatype 
Predicates  into  OWL"  by  Jeff  Z.  Pan  and  Ian  Horrocks,  2005  OWL:  Experiences  and  Directions 
Workshop,  Galway,  Ireland,  November  2005.  This  paper  is  available  at 
http://www.mindswap.org/2005/QWLWorkshop/subl0.pdf  and  as  Appendix  B. 

A  third  extension  involved  the  addition  of  operators  that  allow  the  construction  of  complex 
relationships,  such  as  construction  of  the  uncle  relationship  from  the  parent  and  brother 
relationships.  An  early  version  of  this  extension  is  documented  in  the  paper  "The  Irresistible 
SRIQ"  by  Ian  Horrocks,  Oliver  Kutz,  and  Ulrike  Sattler,  2005  OWL:  Experiences  and  Directions 
Workshop,  Galway,  Ireland,  November  2005.  This  paper  is  available  at 
http://www.mindswap.org/2005/QWLWorkshop/sub20.pdf  and  as  Appendix  C.  The  final 
version  of  this  extension  is  documented  in  the  paper  “The  Even  More  Irresistible  SROIQ”  by  Ian 
Horrocks,  Oliver  Kutz,  and  Ulrike  Sattler,  in  Proceedings  of  the  Tenth  International  Conference 
on  Knowledge  Representation  and  Reasoning,  June  2006.  A  version  of  this  paper  is  attached  as 
Appendix  L. 

A  report  on  the  use  of  Description  Logics  similar  to  OWL  DL  in  support  of  actual  reasoning 
tasks  is  contained  in  the  paper  "Description  Logics  in  Ontology  Applications"  by  Ian  Horrocks, 
Proceedings  of  the  9th  International  Conference  on  Automated  Reasoning  with  Analytic 
Tableaux  and  Related  Methods  (TABLEAUX  2005),  Koblenz,  Germany,  September  2005, 
Lecture  Notes  in  Artificial  Intelligence  number  3702,  pages  2-13,  Springer,  2005.  This  paper  is 
available  at  http://www.cs.man.ac.uk/~hoiTocks/Publications/download/2005/HoiT05b.pdf  and  as 
Appendix  D. 

Portions  of  these  extensions  were  incorporated  into  a  proposal  for  an  extension  to  the  W3C  OWL 
DL  Web  Ontology  Language.  This  extension,  called  OWL  1.1,  is  documented  in  the  report 
"OWL  1.1  Web  Ontology  Language  Syntax"  by  Peter  L.  Patel-Schneider,  Bell  Labs  Research, 
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Lucent  Technologies,  12  January  2006.  This  report  is  available  at  http://www-db.research.bell- 
labs.com/user/pfps/owl/syntax.html  and  as  Appendix  E.  The  overall  OWL  1 . 1  effort  has  a  home 
page  at  http://owll-l.cs.manchester.ac.uk/. 

An  analysis  of  general  techniques  for  optimizing  Description  Logic  reasoning  was  performed. 
Existing  techniques  were  categorized  being  suitable  for  use  in  expressive  Description  Logics  or 
not.  Several  new  techniques  were  devised  and  also  deemed  suitable  for  use  in  expressive 
Description  Logics. 

The  optimizations  are  documented  in  three  papers:  "Optimised  Classification  for  Taxonomic 
Knowledge  Bases"  by  Dmitry  Tsarkov  and  Ian  Horrocks,  in  Proceedings  of  the  2005  Description 
Logic  Workshop  (DL-2005),  Edinburgh,  Scotland,  July  2005  (available  at 
http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-147/39-TsarHorr.pdf  and 

as  Appendix  L),  "Ordering  heuristics  for  description  logic  reasoning"  by  Dmitry  Tsarkov  and  Ian 
Horrocks,  in  Proceedings  of  the  Nineteenth  International  Joint  Conference  on  Artificial 
Intelligence  (IJCAI  2005),  Edinburgh,  Scotland,  August  2005  (available  at 
http://www.cs.man.ac.uk/~horrocks/Publications/download/2005/TsHo05a.pdf  and  as  Appendix 
G),  and  "Optimised  Terminological  Reasoning  for  Expressive  Description  Logics"  by  Dmitry 
Tsarkov,  Ian  Horrocks,  and  Peter  F.  Patel- Schneider,  which  has  been  submitted  to  the  Journal  of 
Automated  Reasoning.  (The  submitted  version  appears  as  Appendix  H.) 

A  new  Description  Logic  system  was  implemented  in  C++.  This  system,  FaCT++,  incorporates 
many  existing  optimizations,  identified  during  the  project,  as  well  as  some  new  optimizations, 
also  identified  during  the  project.  FaCT++  can  reason  over  several  expressive  Description 
Logics,  including  SHOIQ  (and  thus  is  a  reasoner  for  the  OWL  DL). 

An  overall  description  of  FaCT++  can  be  found  in  "System  Description:  FaCT++1.0"  by  Dmitry 
Tsarkov  and  Ian  Horrocks.  This  document  is  enclosed  with  electronic  forms  of  this  report  as 
Appendix  I. 

The  code  and  documentation  for  FaCT++  can  be  downloaded  from 
http : //o wl . man .ac.uk/factplusplus/.  A  copy  of  the  source  code  and  documentation  as  of  21  April 
2006  appears  as  Appendix  M. 

Two  analyses  of  the  performance  of  Description  Logic  systems  on  existing  ontologies  were 
performed,  one  with  previously-existing  systems  and  one  including  the  new  Description  Logic 
system.  The  analyses  are  reported  on  in  two  papers:  "Benchmarking  DL  Reasoners  Using 
Realistic  Ontologies"  by  Zhengxiang  Pan,  in  OWL:  Experiences  and  Directions  workshop, 
Galway,  Ireland,  November  2005  (available  at 

http://www.mindswap.org/2005/QWLWorkshop/sub6.pdf  and  as  Appendix  J)  and  "Automated 
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Benchmarking  of  Description  Logic  Reasoners"  by  Tom  Gardiner,  Ian  Horrocks,  and  Dmitry 
Tsarkov,  in  Proceedings  of  the  2006  Description  Logic  Workshop  (DL-2006),  June  2006 
(enclosed  as  Appendix  K). 

An  analysis  of  the  existing  DIG  1.1  interface  to  Description  Logic  systems  was  performed.  A 
proposal  for  a  new  interface  to  Description  Logic  systems,  called  DIG  2.0,  was  underway  as  of 
the  end  of  the  project.  DIG  2.0  is  the  result  of  efforts  by  an  international  group  of  researchers, 
including  from  this  project.  Information  on  DIG  2.0  can  be  found  at 
http://homepages.cs.manchester.ac.uk/~seanb/dig/. 


4  Important  Findings  and  Conclusions 

The  project  resulted  in  two  different  kinds  of  findings. 

The  first  finding  relates  to  extensions  of  expressive  Description  Logics.  It  was  found  that  a  small 
number  of  extensions  (composition  of  relationships,  inclusions  of  composed  relationships,  and 
punning)  dramatically  improve  the  acceptability  of  the  W3C  OWL  DL  Web  Ontology  Language. 
These  extensions  have  been  incorporated  into  the  OWL  1 . 1  language,  which  is  being 
implemented  by  several  groups  worldwide. 

The  second  findings  relate  to  implementation  of  reasoners  for  expressive  Description  Logics. 
The  FaCT++  Description  Logic  reasoner  constructed  during  the  project  is  competitive  with 
existing  systems  and  adequate  for  a  considerable  number  of  reasoning  tasks,  a  result  that  was 
somewhat  unexpected.  However  no  existing  Description  Logic  reasoner  is  adequate  for  many 
other  reasoning  tasks,  as  expected,  indicating  that  more  research  needs  to  be  done  in  further 
optimization  of  such  reasoners. 


5  Significant  Developments 

The  most  significant  developments  of  the  project  are: 

La  new  analysis  of  optimizations  for  reasoning  in  expressive  Description  Logics,  including 
new  optimizations; 

2. the  FaCT++  Description  Logic  system,  a  new  high-performance  Description  Logic  system 
able  to  reason  over  the  W3C  OWL  DL  Web  Ontology  Language;  and 

3. the  OWL  1.1  extension  to  the  W3C  OWL  DL  Web  Ontology  Language. 
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6  Special  Comments 

The  work  in  this  project  was  greatly  assisted  by  research  interactions  with  the  worldwide 
community  interest  in  Description  Logics  and  ontologies.  In  particular,  the  OWL  1.1  and  DIG 
2.0  efforts  were  the  result  of  collaborations  between  geographically  distributed  researchers.  The 
results  of  such  interactions  are  of  the  highest  importance  in  the  development  of  languages  and 
systems  for  ontology  reasoning.  Only  in  this  way  can  general  consensus  on  methods  be  achieved 
and  this  consensus  is  needed  for  widespread  adoption. 


7  Implications  for  Further  Research 

It  turned  out  that  getting  to  state  of  the  art  in  Description  Logic  reasoners  was  not  hard.  The 
FaCT++  reasoner  developed  for  Phase  1  of  the  project  was  only  supposed  to  be  a  testbed  for 
further  work  on  optimization,  but  it  turned  out  to  be  competitive  with  existing  high-performance 
reasoners.  However,  although  FaCT++  and  other  existing  high-performance  Description  Logic 
reasoners  are  now  suitable  for  many  tasks,  there  are  many  other  tasks  for  which  they  are  not  fast 
enough.  Therefore  more  research  and  implementation  on  optimizations  for  reasoning  in 
Description  Logics  and  ontologies  are  needed. 

During  the  course  of  the  project  it  was  evident  that  the  existing  state  of  the  Semantic  Web  is  in 
flux  and  that  its  overall  design  still  needs  work.  The  inadequate  foundation  of  the  Semantic  Web 
resulted  in  considerable  work  being  done  in  the  project  to  get  around  its  limitations.  These 
problems  are  currently  affecting  the  W3C  Rule  Interchange  Format  working  group.  Research 
needs  to  be  performed  and  consensus  achieved  in  this  important  resource. 

As  stated  above  it  was  important  to  gather  international  groups  to  perform  research  related  to 
some  of  the  goals  of  the  project.  In  this  project  the  groups  (the  DIG  group  and  the  OWL  1.1 
group)  were  ad  hoc  groups  of  researchers  interested  in  research  related  to  the  goals  of  the  project. 
There  are  still  quite  a  number  of  areas  where  further  collaborative  research  related  to  the  goals  of 
the  project  is  needed,  including  future  work  by  the  DIG  and  OWL  1.1  groups  but  also  work  on 
rules  and  foundations  of  the  Semantic  Web.  DARPA  has  played  a  major  role  in  this  arena 
before,  notably  with  the  DAML  project's  development  of  the  DAML+OIL  web  ontology 
language— an  important  precursor  to  the  W3C  OWL  Web  Ontology  Language. 

DARPA  can  and  should  continue  to  play  an  important  role  in  future  work  in  this  vein,  as  it  is  one 
of  the  very  few  institutions  in  the  United  States  that  funds  large,  collaborative  projects.  DARPA 
can  and  should  support  future  collaborative  work  on  languages  for  Semantic  Web  and  for  agent 
communication.  Such  work  is  of  interest  to  DARPA's  customers,  as  evidenced  by  the  upcoming 
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Language  for  Intelligent  Machines  workshop  sponsored  by  the  US  Army  Research  Office. 
Standards  bodies  (e.g.,  W3C,  etc.)  do  not  have  efforts  that  attract  nearly  the  same  caliber  of 
researchers,  but  can  serve  to  finalize  and  disseminate  the  results  of  such  DARPA  projects. 


8  List  of  Abbreviations  and  Acronyms 

DAML  -  DARPA  Agent  Markup  Language 

DARPA  -  Defense  Advanced  Research  Projects  Agency 

DL  -  Description  Logic 

DLP  -  Description  Logic  Processor 

FaCT  -  Fast  Classification  of  Terminologies 

IPTO  -  Information  Processing  Technology  Office 

OIL  -  Ontology  Inference  Language 

OWL  -  W3C  Web  Ontology  Language 

RACER  -  Reasoner  for  Aboxes  and  Concept  Expressions  Renamed 
SRIQ  -  SRIQ  Description  Logic 
SROIQ  -  SROIQ  Description  Logic 
W3C  -  World  Wide  Web  Consortium 
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9  Appendices 

Appendix  A: 

"OWL  rules:  A  proposal  and  prototype  implementation"  by  Ian  Horrocks,  Peter  F.  Patel- 
Schneider,  Sean  Bechhofer,  and  Dmitry  Tsarkov,  Journal  of  Web  Semantics,  3(l):23-40,  July 
2005.  Enclosed  with  the  full  electronic  version  of  this  report  as  the  file  jws-rules.pdf. 
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OWL  Rules:  A  Proposal  and  Prototype 
Implementation 

Ian  Horrocks  Peter  F.  Patel-Schneider  Sean  Bechhofer 
Dmitry  Tsarkov 

February  28,  2005 


Abstract 

Although  the  OWL  Web  Ontology  Language  adds  considerable  expressive 
power  to  the  Semantic  Web  it  does  have  expressive  limitations,  particularly  with 
respect  to  what  can  be  said  about  properties.  We  present  SWRL  (the  Semantic  Web 
Rules  Language),  a  Horn  clause  rules  extension  to  OWL  that  overcomes  many  of 
these  limitations.  SWRL  extends  OWL  in  a  syntactically  and  semantically  coher¬ 
ent  manner:  the  basic  syntax  for  SWRL  rules  is  an  extension  of  the  abstract  syntax 
for  OWL  DL  and  OWL  Lite;  SWRL  rules  are  given  formal  meaning  via  an  exten¬ 
sion  of  the  OWL  DL  model-theoretic  semantics;  SWRL  rules  are  given  an  XML 
syntax  based  on  the  OWL  XML  presentation  syntax;  and  a  mapping  from  SWRL 
rules  to  RDF  graphs  is  given  based  on  the  OWL  RDF/XML  exchange  syntax.  We 
discuss  the  expressive  power  of  SWRL.  showing  that  the  ontology  consistency 
problem  is  undecidable,  provide  several  examples  of  SWRL  usage,  and  discuss  a 
prototype  implementation  of  reasoning  support  for  SWRL. 


1  Introduction 

The  OWL  Web  Ontology  Language  [47]  adds  considerable  expressive  power  to  the 
Semantic  Web.  However,  for  a  variety  of  reasons  (see  http  :  / /lists  .  w3  .  org/ 
Archives/Public/www-webont-wg/  and  [20]),  including  retaining  the  decid¬ 
ability  of  key  inference  problems  in  OWL  DL  and  OWL  Lite,  OWL  has  expressive 
limitations.  These  restrictions  can  be  onerous  in  some  application  domains,  for  exam¬ 
ple  in  describing  web  services,  where  it  may  be  necessary  to  relate  inputs  and  outputs 
of  composite  processes  to  the  inputs  and  outputs  of  their  component  processes  [51], 
or  in  medical  informatics,  where  it  may  be  necessary  to  transfer  characteristics  across 
partitive  properties  [39]. 

Many  of  the  limitations  of  OWL  stem  from  the  fact  that,  while  the  language  in¬ 
cludes  a  relatively  rich  set  of  class  constructors,  the  language  provided  for  talking  about 
properties  is  much  weaker.  In  particular,  there  is  no  composition  constructor,  so  it  is 
impossible  to  capture  relationships  between  a  composite  property  and  another  (possibly 
composite)  property.  The  standard  example  here  is  the  obvious  relationship  between 
the  composition  of  the  “parent”  and  “brother”  properties  and  the  "uncle"  property. 

One  way  to  address  this  problem  would  be  to  extend  OWL  with  a  more  powerful 
language  for  describing  properties.  For  example,  a  decidable  extension  of  the  descrip¬ 
tion  logics  underlying  OWL  DL  to  include  the  use  of  composition  in  subproperty  ax¬ 
ioms  has  already  been  investigated  [22,  23].  In  order  to  maintain  decidability,  however. 
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the  usage  of  the  constructor  is  limited  to  axioms  of  the  form  P  o  Q  C  P,  i.e.,  axioms 
asserting  that  the  composition  of  two  properties  is  a  subproperty  of  one  of  the  com¬ 
posed  properties.  This  means  that  complex  relationships  between  composed  properties 
cannot  be  captured — in  fact  even  the  relatively  simple  "uncle”  example  cannot  not  be 
captured  (because  "uncle”  is  not  one  of  “parent”  or  “brother”). 

An  alternative  way  to  overcome  some  of  the  expressive  restrictions  of  OWL  would 
be  to  extend  it  with  some  form  of  “rules  language”.  In  fact  adding  rules  to  description 
logic  based  knowledge  representation  languages  is  far  from  being  a  new  idea.  Several 
early  description  logic  systems,  e.g..  Classic  [38,  8],  included  a  rule  language  compo¬ 
nent.  In  these  systems,  however,  rules  were  given  a  weaker  semantic  treatment  than 
axioms  asserting  sub-  and  super-class  relationships;  they  were  only  applied  to  indi¬ 
viduals,  and  did  not  affect  class  based  inferences  such  as  the  computation  of  the  class 
hierarchy.  More  recently,  the  CARIN  system  integrated  rules  with  a  description  logic 
in  such  a  way  that  sound  and  complete  reasoning  was  still  possible  [28].  This  could 
only  be  achieved,  however,  by  using  a  rather  weak  description  logic  ( much  weaker 
than  OWL),  and  by  placing  severe  syntactic  restrictions  on  the  occurrence  of  descrip¬ 
tion  logic  terms  in  the  (heads  of)  rules.  Similarly,  the  DLP  language  proposed  in  [14] 
is  based  on  the  intersection  of  a  description  logic  with  horn  clause  rules;  the  result  is 
obviously  a  decidable  language,  but  one  that  is  necessarily  less  expressive  than  either 
the  description  logic  or  rules  language  from  which  it  is  formed. 

In  this  paper  we  show  how  a  simple  form  of  Horn-style  rules  can  be  added  to  the 
OWL  language  in  a  syntactically  and  semantically  coherent  manner,  the  basic  idea 
being  to  add  such  rules  as  a  new  kind  of  axiom  in  OWL  DL.  We  show  (in  Section  3) 
how  the  OWL  abstract  syntax  in  the  OWL  Semantics  and  Abstract  Syntax  document 
[37]  can  be  extended  to  provide  a  formal  syntax  for  these  rules,  and  (in  Section  4)  how 
the  direct  OWL  model-theoretic  semantics  for  OWL  DL  can  be  extended  to  provide  a 
formal  meaning  for  OWL  ontologies  including  rules  written  in  this  abstract  syntax.  We 
will  also  show  (in  Section  5)  how  OWL's  XML  presentation  syntax  can  be  modified  to 
deal  with  the  proposed  rules. 

The  extended  language  was  originally  called  ORL  (the  OWL  Rules  Language),  but 
is  now  much  better  known  as  SWRL  (the  Semantic  Web  Rules  Language),  a  name 
that  was  coined  when  the  Joint  US/EU  ad  hoc  Agent  Markup  Language  Committee1 
developed  a  W3C  members  submission  based  on  ORL.2  Although  SWRL  includes 
some  additional  features  (mainly  related  to  datatypes  and  predicates)  and  has  some 
minor  syntactic  differences,  we  will  refer  to  the  language  described  here  as  SWRL. 

SWRL  is  considerably  more  powerful  than  either  OWL  DL  or  Horn  rules  alone.  We 
will  show  (in  Section  6)  that  the  key  inference  problems  (e.g.,  ontology  consistency) 
for  SWRL  are  undecidable,  and  (in  Section  7)  provide  examples  that  utilise  the  power 
of  the  combined  languages. 

In  Section  8  we  show  how  OWL's  RDF  syntax  can  be  extended  to  deal  with  rules, 
and  in  Sections  9  and  10  we  discuss  how  reasoning  support  for  SWRL  might  be  pro¬ 
vided.  Finally  (in  Section  1 1),  we  summarise  the  main  features  of  the  SWRL  proposal 
and  suggest  some  directions  for  future  work. 

Jhttp : //www . daml . org/ committee/ 

2http : //www . w3 . org/Submission/2004/SUBM-SWRL-20040521/ 
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2  Overview 


The  basic  idea  of  the  proposal  is  to  extend  OWL  DL  with  a  form  of  rules  while  main¬ 
taining  maximum  backwards  compatibility  with  OWL's  existing  syntax  and  semantics. 
To  this  end,  we  add  a  new  kind  of  axiom  to  OWL  DL,  namely  Horn  clause  rules,  ex¬ 
tending  the  OWL  abstract  syntax  and  the  direct  model-theoretic  semantics  for  OWL 
DL  [37]  to  provide  a  formal  semantics  and  syntax  for  OWL  ontologies  including  such 
rules. 

The  proposed  rules  are  of  the  form  of  an  implication  between  an  antecedent  (body  ) 
and  consequent  (head).  The  informal  meaning  of  a  rule  can  be  read  as:  whenever  (and 
however)  the  conditions  specified  in  the  antecedent  hold,  then  the  conditions  specified 
in  the  consequent  must  also  hold. 

Both  the  antecedent  (body)  and  consequent  (head)  of  a  rule  consist  of  zero  or  more 
atoms.  Atoms  can  be  of  the  form  C(x),  P(x,y),  sameAs(x,y)  or  differentFrom(x,y), 
where  C  is  an  OWL  DL  description.  P  is  an  OWL  property,  and  x,y  are  either  variables, 
OWL  individuals  or  OWL  data  values.  Atoms  are  satisfied  in  extended  interpretations 
(to  take  care  of  variables)  in  the  usual  model-theoretic  way,  i.e.,  the  extended  interpre¬ 
tation  maps  the  variables  to  domain  elements  in  a  way  that  satisfies  the  description, 
property,  sameAs,  or  differentFrom,  just  as  in  the  regular  OWL  model  theory. 

Multiple  atoms  in  an  antecedent  are  treated  as  a  conjunction.  An  empty  antecedent 
is  thus  treated  as  trivially  true  (i.e.  satisfied  by  every  interpretation),  so  the  consequent 
must  also  be  satisfied  by  every  interpretation. 

Multiple  atoms  in  a  consequent  are  treated  as  separate  consequences,  i.e.,  they  must 
all  be  satisfied.  In  keeping  with  the  usual  treatment  in  rules,  an  empty  consequent  is 
treated  as  trivially  false  (i.e.,  not  satisfied  by  any  extended  interpretation).  Such  rules 
are  satisfied  if  and  only  if  the  antecedent  is  not  satisfied  by  any  extended  interpreta¬ 
tion.  Note  that  rules  with  multiple  atoms  in  the  consequent  could  easily  be  rewritten 
(by  applying  standard  rules  of  distributivity)  into  multiple  rules  each  with  an  atomic 
consequent. 

It  is  easy  to  see  that  OWL  DL  becomes  undecidable  when  extended  in  this  way  as 
rules  can  be  used  to  simulate  role  value  maps  [46]  and  make  it  easy  to  encode  known 
undecidable  problems  as  a  SWRL  ontology  consistency  problem  (see  Section  6). 

3  Abstract  Syntax 

The  syntax  for  SWRL  in  this  section  abstracts  from  any  exchange  syntax  for  OWL  and 
thus  facilitates  access  to  and  evaluation  of  the  language.  This  syntax  extends  the  ab¬ 
stract  syntax  of  OWL  described  in  the  OWL  Semantics  and  Abstract  Syntax  document 
[37], 

Like  the  OWL  abstract  syntax,  we  will  specify  the  abstract  syntax  for  rules  by 
means  of  a  version  of  Extended  BNF,  very  similar  to  the  Extended  BNF  notation  used 
for  XML  [52],  In  this  notation,  terminals  are  quoted;  non-terminals  are  not  quoted. 
Alternatives  are  either  separated  by  vertical  bars  (  |  )  or  are  given  in  different  produc¬ 
tions.  Components  that  can  occur  at  most  once  are  enclosed  in  square  brackets  ([. . .  ]); 
components  that  can  occur  any  number  of  times  (including  zero)  are  enclosed  in  braces 
({. . .  }).  Whitespace  is  ignored  in  the  productions  given  here. 

Names  in  the  abstract  syntax  are  RDF  URI  references  [27].  These  names  may  be 
abbreviated  into  qualified  names,  using  one  of  the  following  namespace  names: 
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rdf  http: / /www.w3 .org/1999/ 02/22- rdf- syntax- ns# 
rdfs  http: / /www.w3 .org/2000/ 01/ rdf- schema# 
xsd  http: / /www.w3 . org/2001/XMLSchema# 
owl  http : / / www . w3 . org/2002/ 07/ owl# 

The  meaning  of  each  construct  in  the  abstract  syntax  for  rules  is  informally  de¬ 
scribed  when  it  is  introduced.  The  formal  meaning  of  these  constructs  is  given  in 
Section  4  via  an  extension  of  the  OWL  DL  model-theoretic  semantics  [37]. 

3.1  Rules 

From  the  OWL  Semantics  and  Abstract  Syntax  document  [37],  an  OWL  ontology  in 
the  abstract  syntax  contains  a  sequence  of  annotations,  axioms,  and  facts.  Axioms  may 
be  of  various  kinds,  for  example,  subclass  axioms  and  equivalentClass  axioms.  This 
proposal  extends  axioms  to  also  allow  rule  axioms,  by  adding  the  production: 

axiom  ::=  rule 

Thus  a  SWRL  ontology  could  contain  a  mixture  of  rules  and  other  OWL  DL  constructs, 
including  ontology  annotations,  axioms  about  classes  and  properties,  and  facts  about 
OWL  individuals,  as  well  as  the  rules  themselves. 

A  rule  axiom  consists  of  an  antecedent  (body)  and  a  consequent  (head),  each  of 
which  consists  of  a  (possibly  empty)  set  of  atoms.  Just  as  for  class  and  property  ax¬ 
ioms,  rule  axioms  can  also  have  annotations.  These  annotations  can  be  used  for  several 
purposes,  including  giving  a  label  to  the  rule  by  using  the  rdfsdabel  annotation  prop¬ 
erty. 

rule  ::=  "Implies!" {annotation}  antecedent  consequent’)’ 

antecedent  ::=  ’Antecedent)’ {atom}’)’ 

consequent  ::=  ’Consequent(’{atom}’)’ 

Informally,  a  rule  may  be  read  as  meaning  that  if  the  antecedent  holds  (is  “true”), 
then  the  consequent  must  also  hold.  An  empty  antecedent  is  treated  as  trivially  holding 
(true),  and  an  empty  consequent  is  treated  as  trivially  not  holding  (false).  Non-empty 
antecedents  and  consequents  hold  iff  all  of  their  constituent  atoms  hold.  As  mentioned 
above,  rules  with  multiple  consequents  could  easily  be  rewritten  (using  standard  rules 
of  distributivity)  into  multiple  rules  each  with  a  single  atomic  consequent. 

Atoms  in  rules  can  be  of  the  form  C(x),  P(x,y),  Q(x,z),  sameAs(x,y)  or  different- 
From(x,y),  where  C  is  an  OWL  DL  description.  P  is  an  OWL  DL  individual-valued 
Property,  Q  is  an  OWL  DL  data-vahied  Property,  x,y  are  either  variables  or  OWL  in¬ 
dividuals,  and  z  is  either  a  variable  or  an  OWL  data  value.  In  the  context  of  OWL  Lite, 
descriptions  in  atoms  of  the  form  C(x)  may  be  restricted  to  class  names. 

atom  ::=  description  ’(’  i-object  ’)’ 

individualvaluedPropertylD  ’(’  i-object  i-object  ’)’ 
datavaluedPropertylD  ’(’  i-object  d-object  ’)’ 
sameAs  ’(’  i-object  i-object  ’)’ 
differentFrom  ’(’  i-object  i-object  ’)’ 

Informally,  an  atom  C(x)  holds  if  x  is  an  instance  of  the  class  description  C,  an 
atom  P(x,y)  (resp.  Q(x,z))  holds  if  x  is  related  to  y  (z)  by  property  P  (Q),  an  atom 
sameAs(x,y)  holds  if  x  is  interpreted  as  the  same  object  as  y,  and  an  atom  different- 
From(x,y)  holds  if  x  and  y  are  interpreted  as  different  objects. 
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Atoms  may  refer  to  individuals,  data  literals,  individual  variables  or  data  variables. 
Variables  are  treated  as  universally  quantified,  with  their  scope  limited  to  a  given  rule. 
As  usual,  only  variables  that  occur  in  the  antecedent  of  a  rule  may  occur  in  the  conse¬ 
quent  (a  condition  usually  referred  to  as  “safety”). 

i-object  ::=  i-variable  |  individuallD 
d-object  ::=  d- variable  |  dataLiteral 

i-variable  ::=  'I-variable(’  URIreference  ’)’ 
d-variable  ::=  'D-variable(’  URIreference  ’)’ 

3.2  Human  Readable  Syntax 

While  the  abstract  Extended  BNF  syntax  is  consistent  with  the  OWL  specification, 
and  is  useful  for  defining  XML  and  RDF  serialisations,  it  is  rather  verbose  and  not 
particularly  easy  to  read.  In  the  following  we  will,  therefore,  often  use  a  relatively 
informal  “human  readable”  form  similar  to  that  used  in  many  published  works  on  rules. 
In  this  syntax,  a  rule  has  the  form: 

antecedent  — >  consequent, 

where  both  antecedent  and  consequent  are  conjunctions  of  atoms  written  a i  A . . ,  A  an. 
Variables  are  indicated  using  the  standard  convention  of  prefixing  them  with  a  question 
mark  (e.g.,  lx).  Using  this  syntax,  a  rule  asserting  that  the  composition  of  parent  and 
brother  properties  implies  the  uncle  property  would  be  written: 

parent(?a,  lb)  A  brother(?&,  ?c)  — >  uncle(?a,  ?c).  (1) 

If  John  has  Mary  as  a  parent  and  Mary  has  Bill  has  a  brother,  then  this  rule  requires 
that  John  has  Bill  as  an  uncle.  Using  the  abstract  syntax  described  in  Section  3.1,  this 
rule  would  have  been  written  as: 

Implies(Antecedent(parent(I-variable(a)  I-variable(b)) 
brother(I-variable(b)  I-variable(c))) 
Consequent(uncle(I-variable(a)  I-variable(c)))). 


4  Direct  Model-Theoretic 
Semantics 

The  model-theoretic  semantics  for  SWRL  is  a  straightforward  extension  of  the  seman¬ 
tics  for  OWL  DL  given  in  [37].  The  basic  idea  is  that  we  define  bindings — extensions 
of  OWL  interpretations  that  also  map  variables  to  elements  of  the  domain  in  the  usual 
manner.  A  rule  is  satisfied  by  an  interpretation  iff  every  binding  that  satisfies  the  an¬ 
tecedent  also  satisfies  the  consequent.  The  semantic  conditions  relating  to  axioms  and 
ontologies  are  unchanged,  so  an  interpretation  satisfies  an  ontology  iff  it  satisfies  every 
axiom  (including  rules)  and  fact  in  the  ontology. 

4.1  Interpreting  Rules 

Front  the  OWL  Semantics  and  Abstract  Syntax  document  [37]  we  recall  that  an  abstract 
OWL  interpretation  is  a  tuple  of  the  form 

T=  (. R,EC,ER,L,S,LV ), 


5 


where  R  is  a  set  of  resources,  LV  C  R  is  a  set  of  literal  values,  EC  is  a  mapping  from 
classes  and  datatypes  to  subsets  of  R  and  LV  respectively,  ER  is  a  mapping  from 
properties  to  binary  relations  on  R,  L  is  a  mapping  from  typed  literals  to  elements  of 
LV,  and  S'  is  a  mapping  from  individual  names  to  elements  of  EC( owl  :  Thing). 

Given  an  abstract  OWL  interpretation  X,  a  binding  B(I)  is  an  abstract  OWL  inter¬ 
pretation  that  extends  X  such  that  S  maps  i-variables  to  elements  of  EC(ow\  :  Thing) 
and  L  maps  d-variables  to  elements  of  LV  respectively.  An  atom  is  satisfied  by  a  bind¬ 
ing  B  (X)  under  the  conditions  given  in  Table  1 .  where  C  is  an  OWL  DL  description, 
P  is  an  OWL  DL  individual-valued  Property,  Q  is  an  OWL  DL  data-valued  Property, 
x ,  y  are  variables  or  OWL  individuals,  and  z  is  a  variable  or  an  OWL  data  value. 


Atom 

Condition  on  Interpretation 

C(x) 

P(x,y) 

Q{x,z) 
sameAs(a;,  y) 
differentFrom(a;,  y ) 

S{x)  e  EC(C) 

(S(x),S(y))  6  ER(P) 
(S(x),  L(z))  e  ER(Q) 

S(x)  =  S(y) 

S(x)  i=-  S(y) 

Table  1 :  Interpretation  Conditions 


A  binding  B(X)  satisfies  an  antecedent  A  iff  A  is  empty  or  B(X)  satisfies  every 
atom  in  A.  A  binding  B(X)  satisfies  a  consequent  C  iff  C  is  not  empty  and  B(X) 
satisfies  every  atom  in  C.  A  rule  is  satisfied  by  an  interpretation  X  iff  for  every  binding 
B  such  that  B(X)  satisfies  the  antecedent,  B(X)  also  satisfies  the  consequent. 

The  semantic  conditions  relating  to  axioms  and  ontologies  are  unchanged.  In  par¬ 
ticular,  an  interpretation  satisfies  an  ontology  iff  it  satisfies  every  axiom  (including 
rules)  and  fact  in  the  ontology;  an  ontology  is  consistent  iff  it  is  satisfied  by  at  least  one 
interpretation;  an  ontology  O2  is  entailed  by  an  ontology  0 1  iff  every  interpretation 
that  satisfies  0\  also  satisfies  02- 

4.2  Example 

Consider,  for  example,  the  “uncle"  rule  (1)  from  Section  3.2.  Assuming  that  parent, 
brother  and  uncle  are  individualvaluedPropertyIDs,  then  given  an  interpretation  X  = 
(R,  EC,  ER,  L,  S,  LV),  a  binding  B(X)  extends  S  to  map  the  variables  la,  lb,  and  ?c 
to  elements  of  EC(o\n\  :  Thing);  we  will  use  a,  b,  and  c  respectively  to  denote  these 
elements.  The  antecedent  of  the  rule  is  satisfied  by  B{X)  iff  (a,  b)  6  £X?(parent)  and 
(6,  c)  6  iTR(brother).  The  consequent  of  the  rule  is  satisfied  by  BIX)  iff  (a,  c)  6 
iTR(uncle).  Thus  the  rule  is  satisfied  by  X  iff  for  every  binding  B(X)  such  that  (o,  b)  6 
iTR(parent)  and  (b,c)  €  iXR(brother),  then  it  is  also  the  case  that  (a,  c)  6  f?f?.(uncle), 
i.e.: 

Va,  b,c  e  EC{ owl  :  Thing). 

(( a,b )  6  -E-R(parent)  A  (b,  c)  6  iTR(brother))  — >  ( a,c )  6  i?f?(uncle) 


5  XML  Concrete  Syntax 

Many  possible  XML  encodings  could  be  imagined,  but  the  most  obvious  solution  is 
to  extend  the  existing  OWL  Web  Ontology  Language  XML  Presentation  Syntax  [17], 
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which  can  be  straightforwardly  modified  to  deal  with  SWRL.3  This  has  several  advan¬ 
tages: 


•  arbitrary  OWL  classes  (e.g.,  descriptions)  can  be  used  as  predicates  in  rules; 

•  rules  and  ontology  axioms  can  be  freely  mixed; 

•  the  existing  XSLT  stylesheet4  can  easily  be  extended  to  provide  a  mapping  to 
RDF  graphs  that  extends  the  OWL  RDF/XML  exchange  syntax  (see  Section  8). 

In  the  first  place,  the  ontology  root  element  is  extended  so  that  ontologies  can 
include  rule  axioms  and  variable  declarations  as  well  as  OWL  axioms,  import  state¬ 
ments  etc.  We  then  simply  need  to  add  the  relevant  syntax  for  variables  and  rules. 
In  this  paper  we  use  the  unspecified  owlr  namespace  prefix  for  the  newly  intro¬ 
duced  syntax  (the  owlx  namespace  prefix,  which  should  be  treated  as  being  bound  to 
http  :  / /www.  w3  .org/2003/ 05 /owl -xml,  is  used  for  the  existing  OWL  XML 
syntax).  In  practice,  the  owlr  prefix  would  have  to  be  bound  to  some  appropriate 
namespace  name  (e.g.,  the  OWL  namespace  name,  the  OWL  XML  namespace  name, 
or  some  new  namespace  name). 

Variable  declarations  are  statements  about  variables,  indicating  that  the  given  URI 
is  to  be  used  as  a  variable,  and  (optionally)  adding  any  annotations.  For  example: 

<owlr:Variable  owlr:name="x1 "  />, 

states  that  the  URI  xl  (in  the  current  namespace)  is  to  be  treated  as  a  variable. 

Rule  axioms  are  similar  to  OWL  SubClassOf  axioms,  except  they  have  Owlr:Rule 
as  their  element  name.  Like  SubClassOf  and  other  axioms  they  may  include  anno¬ 
tations.  Rule  axioms  have  an  antecedent  (Owlr:antecedent)  component  and  a  conse¬ 
quent  (owlriconsequent)  component.  The  antecedent  and  consequent  of  a  rule  are 
both  lists  of  atoms  and  are  read  as  the  conjunction  of  the  component  atoms.  Atoms  can 
be  formed  from  unary  predicates  (classes),  binary  predicates  (properties),  equalities  or 
inequalities. 

Class  atoms  consist  of  a  description  and  either  an  individual  name  or  a  variable 
name,  where  the  description  in  a  class  atom  may  be  a  class  name,  or  may  be  a  complex 
description  using  boolean  combinations,  restrictions,  etc.  For  example,5 

<owlr:classAtom> 

<owlx:Class  owlx:name="Person"  /> 

<owlr:Variable  owlr:name="x1 "  /> 

</owlr:classAtom> 

is  a  class  atom  using  a  class  name  (#Person),  and 

<owlr:classAtom> 

<owlx:lntersectionOf> 

<owlx:Class  owlx:name="Person"  /> 

<owlx:ObjectRestriction 

owlx:property="  has  Parent  "> 

<owlx:someValuesFrom 

-The  syntax  used  in  the  W3C  Member  Submission  was  changed  slightly  in  order  to  make  it  more  com- 
patible  with  RuleML  (see  http :  /  / www .  ruleml .  org/). 

4http : //www . w3 . org/TR/owl-xmlsyntax/ owlxml2rdf . xsl 

5Note  that  we  use  the  owlx  namespace  prefix  for  the  names  used  in  examples. 
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owlx:class=" Physician"  /> 

</owlx:ObjectRestriction> 

</owlx:lntersectionOf> 

<owlr:Variable  owlr:name="x2"  /> 

</owlr:classAtom> 

is  a  class  atom  using  a  complex  description  representing  Persons  having  at  least  one 
parent  who  is  a  Physician. 

Property  atoms  consist  of  a  property  name  and  two  elements  that  can  be  individual 
names,  variable  names  or  data  values  (as  OWL  does  not  support  complex  property 
descriptions,  a  property  atom  takes  only  a  property  name).  Note  that  in  the  case  where 
the  second  element  is  an  individual  name  the  property  must  be  an  individual-valued 
Property,  and  in  the  case  where  the  second  element  is  a  data  value  the  property  must 
be  a  data-valued  Property.  For  example: 

<owlr:individualPropertyAtom 

owlx:property= "  hasParent "  > 

<owlr:Variable  owlr:name="x1 "  /> 

<owlx:lndividual  owlx:name="John"  /> 
</owlr:individualPropertyAtom> 

is  a  property  atom  using  an  individual-valued  Property  (the  second  element  is  an  indi¬ 
vidual),  and 

<owlr:datavaluedPropertyAtom  owlr:property=  "grade "> 

<owlr:Variable  owlr:name="x1 "  /> 

<owlx:DataValue 

rdf:datatype="&xsd;integer">4</owlx:DataValue> 

</owlr:datavaluedPropertyAtom> 

is  a  property  atom  using  a  data-valued  Property  (the  second  element  is  a  data  value,  in 
this  case  an  integer). 

Finally,  same  (different)  individual  atoms  assert  equality  (inequality)  between  sets 
of  individual  and  variable  names.  Note  that  (in)equalities  can  be  asserted  between 
arbitrary  combinations  of  variable  names  and  individual  names.  For  example: 

<owlr:samelndividualAtom> 

<owlr:Variable  owlr:name="x1 "  /> 

<owlr:Variable  owlr:name="x2"  /> 

<owlx:lndividual  owlx:name="Clinton"  /> 

<owlx:lndividual  owlx:name="Bill_Clinton"  /> 
</owlr:samelndividualAtom> 

asserts  that  the  variables  xl,  x2  and  the  individual  names  Clinton  and  Bill  Clinton  all 
refer  to  the  same  individual. 

5.1  Example 

The  example  rule  from  Section  3.2  can  be  written  in  the  XML  concrete  syntax  for  rules 
as 


<owlx:Rule> 

<owlr:antecedent> 


<owlr:individualPropertyAtom 

owlr:property= "  parent  "> 
<owlr:Variable  owlr:name="a"  /> 
<owlr:Variable  owlr:name="b"  /> 
</owlr:individualPropertyAtom> 
<owlr:individualPropertyAtom 

owlr:property="  brother"  > 
<owlr:Variable  owlr:name="b"  /> 
<owlr:Variable  owlr:name="c"  /> 
</owlr:individualPropertyAtom> 
</owlr:antecedent> 

<owlr:consequent> 

<owlr:individualPropertyAtom 

owlr  :property= "  uncle "  > 
<owlr:Variable  owlr:name="a"  /> 
<owlr:Variable  owlr:name="c"  /> 
</owlr:individualPropertyAtom> 
</owlr:consequent> 

</owlr:Rule> 


6  The  Power  of  Rules 

In  OWL,  the  only  relationship  that  can  be  asserted  between  properties  is  subsump¬ 
tion  between  atomic  property  names,  e.g.,  asserting  that  hasFather  is  a  subPropertyOf 
hasParent.  In  Section  3.2  we  have  already  seen  how  a  rule  can  be  used  to  assert  more 
complex  relationships  between  properties.  While  this  increased  expressive  power  is 
clearly  very  useful,  it  is  easy  to  show  that  it  leads  to  the  undecidability  of  key  inference 
problems,  in  particular  ontology  consistency. 

For  extensions  of  languages  such  as  OWL  DL,  the  undecidability  of  the  consistency 
problem  is  often  proved  by  showing  that  the  extension  makes  it  possible  to  encode  a 
known  undecidable  domino  problem  [4]  as  an  ontology  consistency  problem.  In  partic¬ 
ular,  it  is  well  known  that  such  languages  only  need  the  ability  to  represent  an  infinite 
2-dimensional  grid  in  order  for  consistency  to  become  undecidable  [2,  24].  With  the 
addition  of  rules,  such  an  encoding  is  trivial.  For  example,  given  two  properties  X-SUCC 
and  y-SUCC,  the  rule: 

x-succ(?a,  lb)  A  y-succ(?b,  ?c)  A  y-succ(?a,  Id)  A  x-succ(?d,  ?e) 

— >  sameAs(?c,  ?e), 

along  with  the  assertion  that  every  grid  node  is  related  to  exactly  one  other  node  by 
each  of  x-SUCC  and  y-SUCC,  allows  such  a  grid  to  be  represented.  This  would  be  pos¬ 
sible  even  without  the  use  of  the  sameAs  atom  in  the  consequent — it  would  only  be 
necessary  to  establish  appropriate  relationships  with  a  “diagonal”  property: 

x-succ(?o,  lb)  a  y-succ(?6,  ?c)  — ►  diagonal(?a,  ?c) 
y-succ(?a,  Id)  A  x-succ(?d,  ?e)  —  diagonal(?a,  ?e), 

and  additionally  assert  that  every  grid  node  is  related  to  exactly  one  other  node  by 
diagonal. 
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The  proposed  form  of  OWL  rules  seem  to  go  beyond  basic  Horn  clauses  in  allow¬ 
ing: 

•  conjunctive  consequents; 

•  class  descriptions  as  well  as  class  names  as  predicates  in  class  atoms;  and 

•  equalities  and  inequalities. 

On  closer  examination,  however,  it  becomes  clear  that  most  of  this  is  simply  “syntactic 
sugar”,  and  does  not  add  to  the  power  of  the  language. 

In  the  case  of  conjunctive  consequents,  it  is  easy  to  see  that  these  could  be  elimi¬ 
nated  by  rewriting  using  standard  rules  of  distributivity.  For  example,  the  rule 

A  ->  Ci  A  C2 

is  equivalent  to  —>A  V  (Ci  A  C2)  and,  via  distributivity,  to  {->A  V  Ci)  A  {->A  V  C2),  so 
can  be  rewritten  as  a  semantically  equivalent  pair  of  rules 

A-+  Cx 
A^C2. 

In  the  case  of  class  descriptions,  it  is  easy  to  see  that  a  description  d  can  be  elimi¬ 
nated  from  a  rule  simply  by  adding  an  OWL  axiom  that  introduces  a  new  class  name 
and  asserts  that  it  is  equivalent  to  d,  e.g., 

EquivalentClasses(D  d). 

The  description  can  then  be  replaced  with  the  name,  here  replacing  the  description  d 
with  class  name  D. 

In  the  case  of  equality  atoms,  the  sameAs  property  could  easily  be  substituted  with 
a  "user  defined”  owl  property  called,  for  example,  Eq.  Such  a  property  can  be  given 
the  appropriate  meaning  using  a  rule  of  the  form 

Thing(?x)  — *  Eq(?x,  ?x)  (2) 

and  by  asserting  that  it  is  functional.  It  is  easy  to  see  that  the  interpretation  of  Eq 
corresponds  to  equality  of  elements  in  EC( owl  :  Thing),  i.e., 

Vx,  y  €  EC(aw\  :  Thing). (x,  y)  6  ER(Eq)  <=>  x  =  y, 

and  that  Eq  could  therefore  be  used  instead  of  sameAs  without  changing  the  meaning 
of  the  ontology. 

Proof:  For  the  if  direction,  assume  that  for  some  interpretation  X  there  exists  an  ele¬ 
ment  x  of  EC( owl  :  Thing)  such  that  (x,x)  0  ER( Eq).  Then  a  binding  B(X)  could 
extend  X  so  that  S  maps  ?x  to  x,  and  rule  2  would  not  be  satisfied  by  B(X).  For  the 
only  if  direction,  assume  that  for  some  interpretation  X  there  exist  elements  x,  y  of 
EC(o\n\  :  Thing)  such  that  (x,  y)  6  ER{ Eq)  and  x  y.  From  the  if  direction  we  also 
have  that  (x,  x)  6  ER{ Eq),  so  Eq  would  not  be  functional. 

The  case  of  inequalities  is  slightly  more  complex.  An  owl  property  called,  for 
example,  Neq,  can  be  introduced  and  used  to  capture  some  of  the  meaning  of  the  dif- 
ferentFrom  property  by  adding  a  rule  of  the  form 

Eq(?x,  ?y)  A  Neq(?x,  ?y)  —>  Nothing(?x).  (3) 
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It  is  easy  to  see  that  the  interpretation  of  Neq  is  disjoint  from  the  interpretation  of  Eq, 
i.e., 

Vx, y  6  £’C'(owl  :  Thing). (a:, y)  6  ER.(Neq)  ==>•  i/y, 
and  that  this  leads  to  the  implicit  rule 

Neq(?x,  ly)  — >  differentFrom  (?x,  ly). 

Proof:  Assume  that  for  some  interpretation  X  there  exist  elements  x.  y  of 
EC(o\n\  :  Thing)  such  that  (x,y)  6  -EfR(Neq)  and  (, x,y )  0  _Ef?(differentFrom).  If 
{x,y}  0  _E_R(differentFrom),  then  x  =  y  and  {x,y}  6  ER( Eq).  A  binding  B(X) 
could,  therefore,  extend  X  so  that  S  maps  lx  to  x  and  ly  to  y,  and  rule  3  would  imply 
that  x  6  EC( owl  :  Nothing),  violating  the  semantic  conditions  on  X. 

Rule  3  shows  that  we  could  eliminate  differentFrom  when  it  occurs  in  the  conse¬ 
quent  of  a  rule  simply  by  substituting  Neq.  Neq  does  not,  however,  fully  capture  the 
meaning  of  inequality,  because  there  could  be  pairs  of  elements  in  EC(aw\  :  Thing) 
that  are  in  the  extension  of  neither  Eq  nor  Neq,  i.e.,  differentFrom  does  not  imply  Neq. 
As  a  result,  we  cannot  use  Neq  to  eliminate  occurrences  of  differentFrom  in  the  an¬ 
tecedent  of  a  rule:  in  order  to  do  so  would  require  Neq  to  be  equivalent  to  the  negation 
of  Eq. 


7  Examples  of  SWRL 

We  give  two  further  examples  of  SWRL  that  serve  to  illustrate  some  of  its  utility,  and 
show  how  the  power  of  SWRL  goes  beyond  that  of  either  OWL  DL  or  Horn  rules  alone. 

7.1  Transferring  Characteristics 

The  first  example  is  due  to  Guus  Schreiber,  and  is  based  on  ontologies  used  in  an  image 
annotation  demo  [16]. 

Artist(?x)  A  Style(?y)  A  artistStyle(?x,  ly)  A  creator(?x,  Iz) 

— >  style/period (?z,  ly) 

The  rule  expresses  the  fact  that,  given  knowledge  about  the  Style  of  certain  Artists  (e.g., 
van  Gogh  is  an  Impressionist  painter),  we  can  derive  the  style/period  of  an  art  object 
from  the  value  of  the  creator  of  the  art  object,  where  Style  is  a  term  from  the  Art  and 
Architecture  Thesaurus  (AAT),6  Artist  is  a  class  from  the  Union  List  of  Artist  Names 
(ULAN),7  artistStyle  is  a  property  relating  ULAN  Artists  to  AAT  Styles,  and  both 
creator  and  style/period  are  properties  from  the  Visual  Resources  Association  catalogue 
(VRA),8  with  creator  being  a  subproperty  of  the  Dublin  Core  element  dc:creator.9 

This  rule  would  be  expressed  in  the  XML  concrete  syntax  as  follows  (assuming 
appropriate  entity  declarations): 

<owlr:Rule> 

<owlr:antecedent> 

6http : //www . getty . edu/ re search/ t ools/ vocabulary /aat / 

7http  :  /  /www .  getty .  edu/ re  search/ conduct  ing.re  search/ vocabularies /ulan/ 

8http : / /www . vraweb . org/ 

9http : //dublincore . org/ 


11 


<owlr:classAtom> 

<owlx:Class  owlx:name="&ulan;Artist"  /> 

<owlr:Variable  owlr:name="x"  /> 

</owlr:classAtom> 

<owlr:classAtom> 

<owlx:Class  owlx:name="&aat;Style"  /> 

<owlr:Variable  owlr:name="y"  /> 

</owlr:classAtom> 

<owlr:individualPropertyAtom 

owlr:property="&aatulan;artistStyle"> 

<owlr:Variable  owlr:name="x"  /> 

<owlr:Variable  owlr:name="y"  /> 
</owlr:individualPropertyAtom> 

<owlr:individualPropertyAtom 

owlr:property="&vra;creator"> 

<owlr:Variable  owlr:name="x"  /> 

<owlr:Variable  owlr:name="z"  /> 
</owlr:individualPropertyAtom> 

</owlr:antecedent> 

<owlr:consequent> 

<owlr:individualPropertyAtom 

owlr:property=  "&vra;style/period "  > 

<owlr:Variable  owlr:name="z"  /> 

<owlr:Variable  owlr:name="y"  /> 
</owlr:individualPropertyAtom> 

</owlr:consequent> 

</owlr:Rule> 

The  example  is  interesting  because  it  shows  how  rules  can  be  used  to  “trans¬ 
fer  characteristics"  from  one  class  of  individuals  to  another  via  properties  other  than 
subClassOf — in  this  case,  the  Style  characteristics  of  an  Artist  (if  any)  are  transferred 
(via  the  creator  property)  to  the  objects  that  he/she  creates.  This  idiom  is  much  used  in 
ontologies  describing  complex  physical  systems,  such  as  medical  terminologies,  where 
partonomies  may  be  as  important  as  subsumption  hierarchies,  and  where  characteris¬ 
tics  often  need  to  be  transfered  across  various  partitive  properties  [34,  41,  44],  For 
example,  the  location  of  a  trauma  should  be  transfered  across  the  partOf  property,  so 
that  traumas  located  in  a  partOf  an  anatomical  structure  are  also  located  in  the  structure 
itself  [39],  This  could  be  expressed  using  a  rule  such  as 

Trauma(?x)  A  Location(?y)  A  isLocatedln(?x,  ?y)  A  isPartOf(?j/,  ?z) 

— >  isLocatedln(?x,  ?z) 

A  similar  technique  could  be  used  to  transfer  properties  to  composite  processes  from 
their  component  processes  when  describing  web  services. 

Terminology  languages  designed  specifically  for  medical  terminology  such  as  Grail 
[40]  and  SNOMED-RT  [48]  often  allow  this  kind  of  idiom  to  be  expressed,  but  it  cannot 
be  expressed  in  OWL  (not  even  in  OWL  full).  Thus  this  kind  of  rule  shows  one  way  in 
which  SWRL  goes  beyond  the  expressive  power  of  OWL  DL. 
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7.2  Inferring  the  Existence  of  New  Individuals 

The  second  example  is  due  to  Mike  Dean,  and  illustrates  a  scenario  in  which  we  want 
to  express  the  fact  that  for  every  Airport  there  is  a  map  Point  that  has  the  same  location 
(latitude  and  longitude)  as  the  Airport  and  that  is  an  object  of  “layer”  (a  map  Draw- 
ingLayer).10  Moreover,  this  map  point  has  the  Airport  as  an  underlyingObject  and  has 
the  Airport  name  as  its  Label.  Note  how  the  expressive  power  of  SWRL  allows  “ex- 
istentials”  to  be  expressed  in  the  head  of  a  rule — it  is  asserted  that,  for  every  Airport, 
there  must  exist  such  a  map  point  (using  an  OWL  someValuesFrom  restriction  in  a 
class  atom).  In  this  way  SWRL  goes  beyond  the  expressive  power  of  Horn  rules. 

The  first  part  of  this  example  is  background  knowledge  about  Airports  and  maps 
expressed  in  OWL  DL.  (A  few  liberties  have  been  taken  with  the  OWL  DL  ab¬ 
stract  syntax  here  in  the  interests  of  better  readability.)  In  particular,  it  is  stated  that 
map:location  and  map:object  are  individual-valued  Properties  with  inverse  proper¬ 
ties  map:isLocationOf  and  map:isObjectOf  respectively;  that  latitude  and  longitude 
are  data-valued  Properties;  that  map:Location  is  a  class  whose  instances  have  ex¬ 
actly  one  latitude  and  exactly  one  longitude,  both  being  of  type  xsd:double;  that  layer 
is  an  instance  of  map:DrawingLayer;  that  map  is  an  instance  of  map:Map  whose 
map:name  is  "Airports"  and  whose  map:layer  is  layer;  and  that  airport:GEC 
is  an  instance  of  airport-ont; Airport  whose  name  is  "Spokane  Inti"  and  whose 
location  is  latitude  47.6197  and  longitude  117.5336. 

ObjectProperty  (map:location) 

ObjectProperty  (map:isLocationOf 
inverseOf  (map:location)  ) 

ObjectProperty  (mapiobject) 

ObjectProperty  (map:isObjectOf 
inverseOf  (map:location)  ) 

DatatypeProperty  (latitude) 

DatatypeProperty  (longitude) 

Class  (map:Location  primitive 
intersectionOf  ( 

restriction  (latitude  allValuesFrom  (xsd:double)  ) 
restriction  ( latitude  minCardinality  ( l ) ) 
restriction  (longitude  allValuesFrom  (xsd:double)  ) 
restriction  (longitude  minCardinality  (l)  )  )  ) 

Individual  (layer  type  (map:Drawingl_ayer)  ) 

Individual  (map  type  (map:Map) 
value  (mapmame  "Airports") 
value  (map:layer  layer)) 

Individual  (airport:GEC  type  (airport-ont:Airport) 
value  (name  "Spokane  Inti") 
value  (location  Individual  (value  (latitude  47.6197) 
value  (longitude  117.5336)))) 

10http : //www . daml . org/2003/06/ruletest s/translation- 3 . n3 
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The  first  rule  in  the  example  requires  that  if  a  map:Location  is  the  sameLocation 
as  another  location,  then  it  has  the  same  values  for  latitude  and  longitude. 

map: Location (?map/oc)  A  sameLocation(?Zoc,  Imaploc) A 
latitude(?Zoc,  Hat)  A  longitude(?(oc,  lion) 

— >  latitude(?map(oc,  Hat)  A  latitude(?mapioc,  Hon) 

The  second  rule  requires  that  wherever  an  airport-ont:Airport  is  located,  there  is 
some  map  location  that  is  the  sameLocation  as  the  Airport’s  location,  and  that  is 
the  location  of  a  map:Point  that  is  an  object  of  the  map:DrawingLayer  “layer".  Note 
that  the  head  of  the  rule  is  an  atom  of  the  form  C(?loc),  where  the  class  C  is  an  OWL 
restriction. 

airport-ont:Airport(?cwrport)  A  location (fairpart,  Hoc) A 
latitude(?/oc,  Hat)  a  longitude(?(oc,  Hon) 

— >  restriction(sameLocation 
someValuesFrom( 
intersectionOf(map  :  Location 
restriction(isLocationOf 
someValuesFrom( 
intersectionOf(map  :  Point 
restriction(map  :  isObjectOf 
someValuesFrom  (OneOf  (layer)) )))))))  (Hoc) 

The  third  rule  requires  that  the  map:Point  whose  map:location  is  the 
map:Location  of  an  airport-ont: Airport  has  the  airport  as  a  map:underlyingObject 
and  has  a  map:label  which  is  the  name  of  the  airport. 

airport-ont:Airport(?a«rport)  A  map:location {lair port,  Hoc)  A 

sameLocation(?Zoc,  Imaploc)  A  map : Location  (?pomf,  Imaploc)  A 
airport-ont:name(?airporf,  Iname) 

— ►  map:underlyingObject (1  point, 1  airport)  A 

map:label(?pomt,  ?name) 


8  Mapping  to  RDF  Graphs 

It  is  widely  assumed  that  the  Semantic  Web  will  be  based  on  a  hierarchy  of  (increas¬ 
ingly  expressive)  languages,  with  RDF/XML  providing  the  syntactic  and  semantic 
foundation  (see,  e.g.,  [5]).  In  accordance  with  this  design  philosophy,  the  charter  of 
the  W3C  Web  Ontology  Working  Group  (the  developers  of  the  OWL  language)  explic¬ 
itly  stated  that  "The  language  will  use  the  XML  syntax  and  datatypes  wherever  possible , 
and  will  be  designed  for  maximum  compatibility  with  XML  and  RDF  language  conven¬ 
tions.”.  In  pursuance  of  this  goal,  the  working  group  devoted  a  great  deal  of  effort  to 
developing  an  RDF  based  syntax  for  OWL  that  was  also  consistent  with  the  semantics 
of  RDF  [20].  It  is.  therefore,  worth  considering  how  this  design  might  be  extended  to 
encompass  rules. 

One  rather  serious  problem  is  that,  unlike  OWL,  rules  have  variables,  so  treating 
them  as  a  semantic  extension  of  RDF  is  very  difficult.  It  is,  however,  still  possible 


14 


to  provide  an  RDF  syntax  for  rules — it  is  just  that  the  semantics  of  the  resultant  RDF 
graphs  may  not  be  an  extension  of  the  RDF  Semantics  [15]. 

A  mapping  to  RDF/XML  is  most  easily  created  as  an  extension  to  the  XSLT  trans¬ 
formation  for  the  OWL  XML  Presentation  syntax.11  This  would  introduce  RDF  classes 
for  SWRL  atoms  and  variables,  and  RDF  properties  to  link  atoms  to  their  predicates 
(classes  and  properties)  and  arguments  (variables,  individuals  or  data  values).12  The 
example  rule  given  in  Section  7.1  (that  equates  the  style/period  of  art  objects  with  the 
style  of  the  artist  that  created  them)  would  be  mapped  into  RDF  as  follows: 

<owlr:Variable  rdf:ID="x"/> 

<owlr:Variable  rdf:ID="y"/> 

<owlr:Variable  rdf:ID="z"/> 

<owlr:Rule> 

<owlr:antecedent  rdf:parseType=  "Collection "> 

<owlr:classAtom> 

<owlr:classPredicate 

rdf:resource="&ulan;Artist"/> 

<owlr:argument1  rdf:resource="#x"  /> 

</owlr:classAtom> 

<owlr:classAtom> 

<owlr:classPredicate 

rdf  :resource= "  &aat;Sty  le "  /  > 

<owlr:argument1  rdf:resource="#y"  /> 

</owlr:classAtom> 

<owlr:individualPropertyAtom> 

<owlr:propertyPredicate 

rdf:resource="&aatulan;artistStyle"/> 

<owlr:argument1  rdf:resource="#x"  /> 

<owlr:argument2  rdf:resource="#y"  /> 
</owlr:individualPropertyAtom> 

<owlr:individualPropertyAtom> 

<owlr:propertyPredicate 

rdf:resource="&vra;creator"/> 

<owlr:argument1  rdf:resource="#x"  /> 

<owlr:argument2  rdf:resource="#z"  /> 
</owlr:individualPropertyAtom> 

</owlr:antecedent> 

<owlr:consequent  rdf:parseType=  "Collection "> 
<owlr:individualPropertyAtom> 

<owlr:propertyPredicate 

rdf:resource="&vra;style/period  "/> 

<owlr:argument1  rdf:resource="#z"  /> 

<owlr:argument2  rdf:resource="#y"  /> 
</owlr:individualPropertyAtom> 

</owlr:consequent> 

</owlr:Rule> 

where  &ulan;.  &aat;,  &aatulan;  and  &vra;  are  assumed  to  expand  into  the  appropriate 

11 http : //www . w3 . org/TR/ owl- xml syntax/ owlxml2rdf . xsl 

1  -The  result  is  similar  to  the  RDF  syntax  for  representing  disjunction  and  quantifiers  proposed  in  [30]. 
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namespace  names.  Note  that  complex  OWL  classes  (such  as  OWL  restrictions)  as  well 
as  class  names  can  be  used  as  the  object  of  SWRL’s  classPredicate  property. 


9  Reasoning  Support  for  SWRL 

Although  SWRL  provides  a  fairly  minimal  rule  extension  to  OWL,  the  consistency 
problem  for  SWRL  ontologies  is  still  undecidable  (as  we  have  seen  in  Section  6).  This 
raises  the  question  of  how  reasoning  support  for  SWRL  might  be  provided. 

It  seems  likely,  at  least  in  the  first  instance,  that  many  implementations  will  provide 
only  partial  support  for  SWRL.  For  this  reason,  users  may  want  to  restrict  the  form  or 
expressiveness  of  the  rules  and/or  axioms  they  employ  either  to  fit  within  a  tractable 
or  decidable  fragment  of  SWRL,  or  so  that  their  SWRL  ontologies  can  be  handled  by 
existing  or  interim  implementations. 

One  possible  restriction  in  the  form  of  the  rules  is  to  limit  antecedent  and  conse¬ 
quent  classAtoms  to  be  named  classes,  with  OWL  axioms  being  used  to  assert  addi¬ 
tional  constraints  on  the  instances  of  these  classes  (in  the  same  document  or  in  external 
OWL  documents).  Adhering  to  this  format  should  make  it  easier  to  translate  rules  to  or 
from  existing  (or  future)  rule  systems,  including  Prolog,  production  rules  (descended 
from  OPS5),  event-condition-action  rules  and  SQL  (where  views,  queries,  and  facts 
can  all  be  seen  as  rules);  it  may  also  make  it  easier  to  extend  existing  rule  based  rea- 
soners  for  OWL  (such  as  Euler13  or  FOWL14)  to  handle  SWRL  ontologies.  Further, 
such  a  restriction  would  maximise  backwards  compatibility  with  OWL-speaking  sys¬ 
tems  that  do  not  support  SWRL.  It  should  be  pointed  out,  however,  that  there  may  be 
some  incompatibility  between  the  first  order  semantics  of  SWRL  and  the  Herbrand 
model  semantics  of  many  rule  based  reasoners. 

By  further  restricting  the  form  of  rules  and  DL  axioms  used  in  SWRL  ontologies  it 
would  be  possible  to  stay  within  DLP,  a  subset  of  the  language  that  has  been  shown  to 
be  expressible  in  either  OWL  DL  or  declarative  logic  programs  (LP)  alone  [14].  This 
would  allow  either  OWL  DL  reasoners  or  LP  reasoners  to  be  used  with  such  ontologies, 
although  there  may  again  be  some  incompatibility  between  the  semantics  of  SWRL  and 
those  of  LP  reasoners. 

Another  obvious  strategy  would  be  to  restrict  the  form  of  rules  and  DL  axioms 
so  that  a  “hybrid"  system  could  be  used  to  reason  about  the  resulting  ontology.  This 
approach  has  been  used,  e.g.,  in  the  CLASSIC  [38]  and  CARIN  systems  [28],  where 
sound  and  complete  reasoning  is  made  possible  mainly  by  focusing  on  query  answer¬ 
ing,  by  restricting  the  DL  axioms  to  languages  that  are  much  weaker  than  OWL,  by 
restricting  the  use  of  DL  terms  in  rules,  and/or  by  giving  a  different  semantic  treatment 
to  rules. 

Finally,  an  alternative  way  to  provide  reasoning  support  for  SWRL  would  be  to 
extend  the  translation  of  OWL  into  TPTP13  implemented  in  the  Hoolet  system,16  and 
use  a  first  order  prover  such  as  Vampire  to  reason  with  the  resulting  first  order  theory 
[42,  54].  This  technique  would  have  several  advantages:  no  restrictions  on  the  form  of 
SWRL  rules  or  axioms  would  be  required;  the  use  of  a  first  order  prover  would  ensure 
that  all  inferences  were  sound  with  respect  to  SWRL’s  first  order  semantics;  and  the 
use  of  the  TPTP  syntax  would  make  it  possible  to  use  any  one  of  a  range  of  state  of  the 

13 http  :  //www .  agf  a  .  com/w3c/ euler/ 

14http  :  /  /fowl .  sourcef orge  .  net 

i5A  standard  syntax  used  by  many  first  order  theorem  provers — see  http :  //www .  tptp .  org 
16http : //www . w3 . org/2 003/ 08/owl- systems /test- result s-out 
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art  first  order  provers.  A  prototype  based  on  this  approach  is  described  in  the  following 
section. 


10  A  Prototype  SWRL  Reasoner 

It  is  well  known  that  OWL  DL  corresponds  to  the  STLOXMV~  Description  Logic 
(DL),  and  that,  like  most  other  DLs,  SHOIAfV~  is  a  fragment  of  classical  first-order 
predicate  logic  (FOL)  [10,  19,  1].  This  suggests  the  idea  of  using  standard  methods  of 
automated  reasoning  for  FOL  as  a  mechanism  for  reasoning  with  OWL  DL. 

This  might  be  done  by  trying  to  create  from  scratch  new  architectures  for  reasoning 
in  FOL,  which  would  be  specialised  for  dealing  efficiently  with  typical  DL  reasoning 
tasks.  A  much  less  expensive  option  is  to  use  existing  implementations  of  FOL  provers, 
with  the  possibility  of  making  adjustments  that  exploit  the  structure  of  DL  reasoning 
tasks.  An  additional  attraction  of  using  a  FO  prover  in  this  way  is  the  fact  that  the  trans¬ 
lation  from  DL  to  FOL  can  be  extended  to  handle  SWRL,  providing  an  implementation 
of  a  SWRL  reasoner. 

Here  we  describe  our  initial  prototype  implementation  of  just  such  a  SWRL  rea¬ 
soner,  known  as  Hoolet.  It  should  be  noted  that  this  initial  implementation  is  rather 
simplistic,  and  is  only  intended  as  a  preliminary  feasibility  study.  We  will,  however, 
discuss  the  issue  of  possible  optimisations. 

There  have  been  earlier  investigations  of  the  use  of  FOL  provers  to  reason  with 
description  logics.  Paramasivam  and  Plaisted,  for  example,  have  investigated  the  use 
of  FOL  reasoning  for  DL  classification  [36],  while  Ganzinger  and  de  Nivelle  have 
developed  decision  procedures  for  the  guarded  fragment,  a  fragment  of  FOL  that  in¬ 
cludes  many  description  logics  [11],  The  most  widely  known  work  in  this  area  was  by 
Hustadt  and  Schmidt  [26],  who  used  the  SPASS  FOL  prover  to  reason  with  proposi¬ 
tional  modal  logics,  and,  via  well  known  correspondences  [45],  with  description  logics. 
Their  technique  involved  the  use  of  a  relatively  complex  functional  translation  which 
produces  a  subset  of  FOL  for  which  SPASS  can  be  tuned  so  as  to  guarantee  complete 
reasoning.  The  results  of  this  experiment  were  quite  encouraging,  with  performance 
of  the  SPASS  based  system  being  comparable,  in  many  cases,  with  that  of  state  of  the 
art  DL  reasoners.  The  tests,  however,  mainly  concentrated  on  checking  the  satisfiabil¬ 
ity  of  (large)  single  modal  logic  formulae  (equivalently,  OWL  class  descriptions/DL 
concepts),  rather  than  the  more  interesting  task  (in  an  ontology  reasoning  context)  of 
checking  the  satisfiability  of  formulae  w.r.t.  a  large  theory  (equivalently,  an  OWL  on- 
tology/DL  knowledge  base). 

In  all  of  the  above  techniques,  the  DL  is  translated  into  (the  guarded  fragment  of) 
FOL  in  such  a  way  that  the  prover  can  be  used  as  a  decision  procedure  for  the  logic — 
i.e.,  reasoning  is  sound,  complete  and  terminating.  Such  techniques  have,  however,  yet 
to  be  extended  to  the  more  expressive  DLs  that  underpin  Web  ontology  languages  such 
as  DAML+OIL  and  OWL  DL  [18],  and  it  is  not  even  clear  if  such  an  extension  would 
be  possible. 

An  alternative  approach,  and  the  one  we  describe  here,  is  to  use  a  simple  “direct” 
translation  based  on  the  standard  first  order  semantics  of  DLs  (see,  e.g.,  [1]).  Using 
this  approach,  an  ontology/knowledge  base  (a  set  of  DL  axioms),  is  translated  into  a  FO 
theory  (a  set  of  FO  axioms).  A  DL  reasoning  task  w.r.t.  the  knowledge  base  (KB)  is  then 
transformed  into  a  FO  task  that  uses  the  theory.  Unlike  methods  such  as  Hustadt  and 
Schmidt’s  functional  translation,  this  does  not  result  in  a  decision  procedure  for  the  DL. 
The  direct  translation  approach  can,  however,  be  used  to  provide  reasoning  services 
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(albeit  without  any  guarantee  of  completeness)  for  the  expressive  DLs  underlying  Web 
ontology  languages,  DLs  for  which  no  effective  decision  procedure  is  currently  known. 
Moreover,  the  translation  approach  can  easily  deal  with  language  extensions  such  as 
SWRL  as  described  here. 

In  recent  years,  a  number  of  highly  efficient  FO  provers  have  been  imple¬ 
mented  [32,  50,  43].  These  provers  compete  annually  on  a  set  of  tasks,  and  the  re¬ 
sults  are  published  [9].  One  of  the  most  successful  general-purpose  provers  has  been 
Vampire  [43],  and  we  have  chosen  this  prover  to  use  in  our  prototype. 

Vampire  is  a  general-purpose  FOL  prover  developed  by  Andrei  Voronkov  and 
Alexandre  Riazanov.  Given  a  set  of  first-order  formulas.  Vampire  transforms  it  into  an 
equisatisfiable  set  of  clauses,  and  then  tries  to  demonstrate  inconsistency  of  the  clause 
set  by  saturating  it  with  ordered  resolution  and  superposition  (see  [3,  33]).  If  the  satura¬ 
tion  process  terminates  without  finding  a  refutation  of  the  input  clause  set,  it  indicates 
that  the  clause  set,  and  therefore  the  original  formula  set,  is  satisfiable,  provided  that 
the  variant  of  the  calculus  used  is  refutationally  complete  and  that  a  fair  strategy17  has 
been  used  for  saturation. 

The  main  input  format  of  Vampire  is  the  TPTP  syntax  [49]  (although  a  parser  for 
a  subset  of  KIF  [12]  has  been  added  recently).  Using  the  TPTP  syntax  in  our  prototype 
means  that  it  would  be  possible  to  substitute  Vampire  with  any  one  of  a  range  of  state 
of  the  art  first  order  provers. 


10.1  Translation  issues 


Translating  OWL  Ontologies  into  FOL  Axioms  We  will  only  discuss  the  transla¬ 
tion  from  DL  to  FOL  as  the  correspondence  between  OWL  DL  and  SHO!AfV~  is 
well  known  [19].  The  translation  (j>  maps  DL  concepts  C  and  role  names  R  into  unary 
and  binary  predicates  4>c{x)  and  <f>R(x,  y)  respectively.  Complex  concepts  and  axioms 
are  mapped  into  FO  formulae  and  axioms  in  the  standard  way  [7,  1].  For  example, 
subsumption  and  equivalence  axioms  are  translated  into,  respectively,  FO  implication 
and  equivalence  (with  the  free  variables  universally  quantified). 

As  an  example,  let’s  see  a  translation  of  a  couple  of  concept  and  role  axioms: 


DL 

R  C  S 

C  =  D\l3R.{EU 
\/S~.F) 

A  C  ^3  R.B 


Transitive  (T) 


FOL 

VxVy((f>R(x,y)  ->  <j>s{x,y)) 

Vx(<j>c{x)  =  <j>D(x)  A  3 y(<j>n{x,y)  A  (<j>E{y)  V 
\/x(<j)S(x,y)  A<I>f(x))))) 

Vx((j) A{x)  -»  3y13y23y3{(j>R{x,y1)  A  <j>B{yi)  A 
<I>r(x,  yi)  A  <Mv 2)  A  4>r(x,  y3)  A  0B(y3)  A 
(' yi  +  y-2.)  a  ( y2  +  2/3)  A  (2/1  y3)) 

VxVyVz(<j>T(x,  y)  A  <j>T(y,  z)  ->  <j>T{x,  z)) 


Simple  DLs  (like  ACC)  can  be  translated  into  the  FOL  class  C2  (the  FOL  fragment 
with  no  function  symbols  and  only  2  variables),  which  is  known  to  be  decidable  [31]. 
The  above  translations  of  the  role  inclusion  axiom  and  concept  equality  axiom  are, 
for  example,  in  C2 .  When  number  restrictions  are  added  to  these  DLs,  they  can  be 
translated  into  C2 — equivalent  to  C2  with  additional  “counting  quantifiers” — which  is 
also  known  to  be  decidable  [13]. 

The  FOL  translation  of  more  expressive  description  logics,  e.g.,  with  transitive 
roles  ( ST11Q ,  OWL  Lite  and  OWL  DL)  and/or  complex  role  axioms  (1ZIQ  [22]), 

17I.e.,  all  generated  clauses  are  eventually  processed 
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may  lead  to  the  introduction  of  three  or  more  variables.18  The  above  transitivity  ax¬ 
iom  for  role  T  is  an  example  of  this  case.  FOL  with  three  variables  is  known  to  be 
undecidable  [7]. 

OWL  DL  also  provides  for  XML  schema  datatypes  [6],  equivalent  to  a  very  simple 
form  of  concrete  domains  [21].  The  minimum  requirement  for  OWL  DL  reasoners  is 
that  they  support  xsd :  integer  and  xsd:  string  datatypes,  where  support  means 
providing  a  theory  of  (in)equality  for  integer  and  string  values  [37]. 

Our  translation  encodes  the  required  datatype  theory  by  mapping  datatypes  into 
predicates  and  data  values  into  new  constants.  Lexically  equivalent  data  values  are 
mapped  to  the  same  constant,  with  integers  first  being  canonicalised  in  the  obvious 
way,  and  axioms  are  added  that  assert  inequality  between  all  the  string  and  integer  data 
constants  introduced.  If  a  data  value  DV  and  a  datatype  DT  are  mapped  to  DV  and 
DT  respectively,  and  DV  is  of  type  DT,  then  an  axiom  DT(DV)  is  also  added.  As 
the  xsd :  integer  and  xsd :  string  interpretation  domains  are  disjoint,  we  add  an 
axiom  to  that  effect.  Finally,  we  add  an  axiom  asserting  the  disjointness  of  the  datatype 
domain  (the  set  of  data  values)  and  the  abstract  domain  (the  set  of  individuals). 

In  accordance  with  the  OWL  DL  semantics,  other  “unsupported"  data  types  are 
treated  opaquely,  i.e.,  data  values  are  mapped  to  the  same  constant  if  they  are  lexically 
identical,  but  no  other  assumptions  are  made  (we  do  not  assume  inequality  if  the  lexical 
forms  are  not  identical)  [37]. 

Translating  SWRL  Rules  into  FOL  Axioms  Using  the  translation  approach,  we 
can  easily  extend  the  first-order  translation  to  SWRL  rules  and  thus  provide  a  simple 
implementation  of  a  SWRL  reasoner. 

As  we  have  seen,  rules  in  SWRL  are  of  the  form: 

B\ , . . . ,  -6m  ■  H\ , . . . ,  Hn 

where  each  of  the  Bi  or  Hj  are  rule  atoms.  Possible  rule  atoms  are  shown  in  Table  2, 
where  C  is  an  OWL  class  description,  R  an  OWL  property  and  i  and  j  are  either  OWL 
individual  names  or  SWRL  variables. 


Table  2:  Rule  Atoms 


Atom 

Type 

C(i) 

Class  Atom 

R(i, j) 

Property  Atom 

i==j 

Equality  Atom 

i!=j 

Inequality  Atom 

In  our  prototype  we  have  only  considered  a  simplification  of  SWRL  where  C  must 
be  a  class  name  (rather  than  arbitrary  class  descriptions),  and  R  must  be  an  object  prop¬ 
erty.  The  first  of  these  restrictions  does  not  affect  the  expressiveness  of  the  language,  as 
new  class  names  can  be  introduced  into  the  ontology  to  represent  any  complex  descrip¬ 
tions  required  in  rules.  The  restriction  to  object  properties  simplifies  our  implemen¬ 
tation,  but  the  translation  we  describe  could  easily  be  extended  to  handle  data  valued 
properties. 

The  translation  of  rules  exactly  follows  the  semantics  of  the  rules  as  given  in  Sec¬ 
tion  4.  Each  rule  is  translated  as  an  implication,  and  any  free  variables  in  the  rule  are 

!  H I n  some  cases,  the  effects  of  transitive  roles  can  be  axiomatised  in  C2  [53]. 
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assumed  to  be  universally  quantified.  Thus  a  rule: 


B  i 


.  Bn 


Hi 


.  H„ 


is  translated  to  an  axiom: 

Vxi,x2, . .  .,xk.T(B1)  A  ...  A  T(Bm)  -a  T(iii)  A  ...  A  T(Hn) 

where  xi,  x%,  . . ,  xk  are  all  the  variables  occurring  in  the  Bi  and  Hj. 

Translation  of  atoms  is  trivial  and  is  shown  in  Table  3.  Combining  this  translation 
with  the  translation  from  OWL  to  FOL  described  above  provides  us  with  a  prototype 
implementation  of  a  SWRL  reasoner.  Given  an  ontology  and  a  collection  of  rules 
relating  to  that  ontology,  we  translate  the  ontology  to  FOL,  and  then  add  the  FOL 
axioms  generated  by  translating  the  rules.  The  resulting  theory  is  passed  to  a  FO  prover 
(Vampire  in  our  case),  where  it  can  be  used  for  reasoning  tasks  such  as  satisfiability 
checking  and  instance  checking. 


Table  3:  Rule  Atom  Translation 


Atom 

Translation 

C(i) 

C(i) 

R (if  j ) 

i==  j 

i=j 

i!=j 

10.2  Examples 

As  an  example,  we  will  consider  a  variant  on  the  "uncle”  example  given  in  Section  3.2: 

hasParent ( ?x, ?y) ,  hasSibling ( ?y ,  ?z) ,  Male(?z) 

=k  hasUncle  (?x,  ?z) 

If  our  ontology  additionally  includes  the  axiom  and  facts  (expressed  here  using  stan¬ 
dard  DL  syntax): 

Uncle  =  3hasUncle“.T 
(Robert, Paul)  :  hasParent 
(Paul, Ian)  :  hasSibling 

then  the  reasoner  can  infer  not  only  hasUncle(Robert,  Ian),  but  also  that  Ian  is 
an  instance  of  the  Uncle  class. 

Another  interesting  aspect  of  the  language  is  illustrated  by  the  following  rule: 

Beer(?x)  =>  Happy (Sean) 

This  expresses  the  fact  that  for  any  instances  of  the  class  Beer,  Sean  must  be  an 
instance  of  Happy.  This  effectively  allows  us  to  express  an  existential  quantification 
over  the  class  Beer:  if  we  can  prove  the  existence  of  an  instance  of  this  class,  then 
Sean  will  be  Happy.  Note  that  we  do  not  actually  have  to  provide  a  name  for  such  an 
instance.  For  example,  if  our  ontology  includes  the  fact: 

Sean  :  Bdrinks.Beer 

then  the  reasoner  can  infer  that  Sean  must  be  Happy  as  we  now  know  that  there  exists 
some  instance  of  Beer — even  though  this  instance  is  unnamed. 


20 


10.3  Performance  and  Optimisation 

Our  prototype  works  well  with  small  examples,  such  as  those  given  in  Sections  7 
and  10.2,  and  we  have  used  it  successfully  with  SWRL  ontologies  containing  up  to 
100  axioms,  rules  and  facts.  However,  while  it  is  useful  to  have  a  prototype  that  can  be 
used  for  illustrative  and  test  purposes,  the  effectiveness  of  such  a  naive  approach  must 
be  open  to  question  with  larger  SWRL  ontologies. 

In  [55]  it  was  shown  that,  when  using  the  same  translation  approach  to  reason  with 
OWL  DL  ontologies,  performance  could  be  greatly  improved  by  using  a  so-called  “rel¬ 
evant  only”  translation.  The  key  idea  is  that  when  ontologies  are  translated.  Vampire 
receives  all  of  the  axioms  that  occur  in  the  ontology,  whereas  usually  only  a  small  frac¬ 
tion  of  them  are  actually  relevant  to  a  given  subsumption  or  inconsistency  problem. 
Vampire  is  not  optimised  to  deal  efficiently  with  large  numbers  of  irrelevant  axioms, 
and  so  it  does  not  perform  well  under  these  circumstances. 

An  obvious  way  to  correct  this  situation  is  to  remove  all  irrelevant  information  from 
the  FO  task  given  to  Vampire.  An  axiom  is  said  to  be  irrelevant  to  a  consistency  test  of 
C  if  it  can  easily  be  shown  (i.e.,  via  a  syntactic  analysis)  that  removing  it  from  the  on¬ 
tology  would  not  affect  the  interpretation  of  C;  other  axioms  are  called  relevant.  Note 
that  not  every  “relevant  axiom”  really  will  affect  the  computation  of  the  consistency 
of  C,  but  we  cannot  (easily)  rule  out  the  possibility  that  it  may  affect  the  computation. 
An  FO-translation  is  called  relevant-only  if  it  contains  only  FO-translations  of  axioms 
relevant  (in  the  above  sense)  to  the  given  satisfiability  test. 

The  definition  of  relevance  given  in  [55]  can  be  extended  to  SWRL  by  treating 
rules  in  the  same  way  as  general  concept  inclusion  axioms  (GCIs).  A  concept  or  role 
expression  depends  on  every  concept  or  role  that  occurs  in  it,  and  a  concept  or  role  C 
depends  on  a  concept  or  role  D  if  D  occurs  in  the  definition  of  C.  In  addition,  a  concept 
C  depends  on  every  GCI  and  rule  in  the  ontology.19  Relevance  is  the  transitive  closure 
of  depends.  The  process  of  selecting  information  relevant  to  a  concept  expression  E 
looks  very  much  the  same  as  unfolding  (see  [1]),  and  assumes  that  the  KB  is  separated 
into  a  set  of  unfoldable  axioms  and  a  set  of  GCIs  [25]  and  rules.  Every  concept  name 
CN  and  role  name  RN  appearing  in  E  is  relevant  to  E.  The  process  is  then  repeated 
recursively  for  unfoldable  axioms  with  CN  on  the  left  hand  side  (whether  inclusion 
or  equality  axioms).  Also,  if  role  R  is  relevant  to  E,  then  so  are  all  roles  R'  s.t.  R  C 
R! ,  along  with  their  inverses  (if  the  target  DL  allows  inverse  roles).  An  algorithm 
for  computing  relevant  information  is  quite  straightforward  and  is  described  in  detail 
in  [54], 

Computing  relevance  leads  to  a  small  overhead  when  translating  a  SWRL  ontology 
into  FOL,  but  it  should  greatly  increase  the  performance  of  the  FO  prover.  Preliminary 
experiments  with  an  extension  of  Hoolet  to  include  an  implementation  of  the  relevant 
only  translation  suggest  that  this  is  indeed  the  case  [29]. 

11  Discussion 

In  this  paper  we  have  presented  SWRL,  a  proposed  extension  to  OWL  to  include  a 
simple  form  of  Horn-style  rules.  We  have  provided  formal  syntax  and  semantics  for 
SWRL,  shown  how  OWL's  XML  and  RDF  syntax  can  be  extended  to  deal  with  SWRL, 

19It  should  be  possible  to  treat  (some)  rules  as  unfoldable  axioms,  add  thus  eliminate  the  need  to  include 
all  rules  in  the  relevant  only  translation,  but  this  is  still  the  subject  of  ongoing  work. 
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illustrated  the  features  of  SWRL  with  several  examples,  and  discussed  how  reasoning 
support  for  SWRL  might  be  provided. 

The  main  strengths  of  the  proposal  are  its  simplicity  and  its  tight  integration  with 
the  existing  OWL  language.  As  we  have  seen,  SWRL  extends  OWL  with  the  most  ba¬ 
sic  kind  of  Horn  rule  (sweetened  with  a  little  “syntactic  sugar”):  predicates  are  limited 
to  being  OWL  classes  and  properties  (and  so  have  a  maximum  arity  of  2),  there  are  no 
disjunctions  or  negations  (of  atoms),  no  built  in  predicates  (such  as  arithmetic  predi¬ 
cates),  and  no  nonmonotonic  features  such  as  negation  as  failure  or  defaults.  Moreover, 
rules  are  given  a  standard  first  order  semantics.  This  facilitates  the  tight  integration  with 
OWL,  with  SWRL  being  defined  as  a  syntactic  and  semantic  extension  of  OWL  DL. 

While  we  believe  that  SWRL  defines  a  natural  and  useful  level  in  the  hierarchy  of 
Semantic  Web  languages,  it  is  clear  that  some  applications  would  benefit  from  further 
extensions  in  expressive  power.  In  particular,  the  ability  to  express  arithmetic  relation¬ 
ships  between  data  values  is  important  in  many  applications  (e.g.,  to  assert  that  persons 
whose  income  at  least  equals  their  expenditure  are  happy,  while  those  whose  expendi¬ 
ture  exceeds  their  income  are  unhappy).  It  is  not  clear,  however,  if  this  would  best  be 
achieved  by  extending  SWRL  to  include  rules  with  built  in  arithmetic  predicates,  or  by 
extending  OWL  Datatypes  to  include  nary  predicates  [35], 

Finally,  we  have  shown  how  a  first  order  theorem  prover  can  be  used  to  provide 
reasoning  services  for  SWRL,  and  how  some  simple  optimisations  can  be  used  to  im¬ 
prove  performance.  Our  results  were  sufficiently  encouraging  to  suggest  that,  with 
further  tuning  and  optimisation,  such  a  strategy  would  be  useful  in  (some)  realistic  ap¬ 
plications.  Future  work  will  include  such  tuning  and  optimisation,  as  well  as  empirical 
investigations  to  determine  the  practical  value  of  the  resulting  system. 
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Introducing  Customised  Datatypes  and 
Datatype  Predicates  into  OWL**) 


Jeff  Z.  Pan  and  Ian  Horrocks 
School  of  Computer  Science.  University  of  Manchester,  UK 


Abstract.  Although  OWL  is  rather  expressive,  it  has  a  very  serious  limitation  on 
datatypes;  i.e.,  it  does  not  support  customised  datatypes.  It  has  been  pointed  out 
that  many  potential  users  will  not  adopt  OWL  unless  this  limitation  is  overcome, 
and  the  W3C  Semantic  Web  Best  Practices  and  Deployment  Working  Group 
has  set  up  a  task  force  to  address  this  issue.  This  paper  provides  a  solution  for 
this  issue  by  presenting  two  decidable  datatype  extensions  of  OWL  DL,  namely 
OWL-Eu  and  OWL-E.  OWL-Eu  provides  a  minimal  extension  of  OWL  DL  to 
support  customised  datatypes,  while  OWL-E  extends  OWL  DL  with  both  cus¬ 
tomised  datatypes  and  customised  datatype  predicates. 


1  Introduction 

The  OWL  Web  Ontology  Language  [1]  is  a  W3C  recommendation  for  expressing  on¬ 
tologies  in  the  Semantic  Web.  Datatype  support  [7,  8]  is  one  of  the  key  features  that 
OWL  is  expected  to  provide,  and  has  prompted  extensive  discussions  in  the  RDF-Logic 
mailing  list  [10]  and  in  the  Semantic  Web  Best  Practices  mailing  list  [12].  Although 
OWL  adds  considerable  expressive  power  to  the  Semantic  Web,  the  OWL  datatype  for¬ 
malism  (or  simply  OWL  datatyping)  is  much  too  weak  for  many  applications;  in  partic¬ 
ular,  OWL  datatyping  does  not  provide  a  general  framework  for  customised  datatypes, 
such  as  XML  Schema  derived  datatypes. 

It  has  been  pointed  out  that  many  potential  users  will  not  adopt  OWL  unless  this 
limitation  is  overcome  [11],  as  it  is  often  necessary  to  enable  users  to  define  their  own 
datatypes  and  datatype  predicates  for  their  ontologies  and  applications.  One  of  the  most 
well  known  type  systems  is  W3C  XML  Schema  Part  2  [2],  which  defines  facilities  to 
allow  users  to  define  customised  datatypes,  such  as  those  defined  by  imposing  some 
restrictions  in  the  value  spaces  of  existing  datatypes. 

Example  1.  Customised  datatypes  are  useful  in  capturing  the  intended  meaning  of  some 
vocabulary  in  ontologies.  For  example,  users  might  want  to  use  the  customised  datatype 
‘atLeastl8’  in  the  following  definition  of  the  class  ‘Adult’: 

Class  (Adult  complete  Person 

restriction  ( age  allValuesFrom  (atLeastl8)  )  )  , 

which  says  that  an  Adult  is  a  Person  whose  age  is  at  least  18.  The  datatype  constraint 

This  work  is  partially  supported  by  the  FP6  Network  of  Excellence  EU  project  Knowledge 
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‘at  least  18’  can  be  defined  as  an  XML  Schema  user-defined  datatype 
<simpleType  name  =  “atLeastl8”> 

<restriction  base  =  “xsd : integer” > 

<minlnclusive  value  =  “18” /> 

</restriction> 

</simpleType> 

in  which  the  facet  ‘minlnclusive’  is  used  to  restrict  the  value  space  of  atLeastl8  (a 
customised  datatype)  to  be  a  subset  of  the  value  space  of  integer  (an  XML  Schema 
built-in  datatype). 

User-defined  datatypes  (like  the  above  one)  cannot,  however,  be  used  in  the  OWL 
datatyping,  which  (only)  provides  the  use  of  some  built-in  XML  Schema  datatypes  and 
enumerated  datatypes,  which  are  defined  by  explicitly  specifying  their  instances.  The 
OWL  datatyping  does  not  support  XML  Schema  customised  datatypes  for  the  following 
two  reasons:  (i)  XML  Schema  does  not  provide  a  standard  way  to  access  a  user-defined 
datatype,  (ii)  OWL  DL  does  not  provide  a  mechanism  to  guarantee  the  computability 
of  the  kinds  of  customised  datatypes  it  supports. 

This  paper  provides  a  solution  for  this  issue  by  presenting  two  decidable  datatype 
extensions  of  OWL  DL,  namely  OWL-Eu  and  OWL-E.  OWL-Eu  provides  a  minimal 
extension  of  OWL  DL  to  support  customised  datatypes,  while  OWL-E  extends  OWL 
DL  with  both  customised  datatypes  and  customised  datatype  predicates.  The  rest  of  the 
paper  is  organised  as  follows:  Section  2  further  discusses  the  motivations  of  introducing 
customised  datatypes  and  datatype  predicates.  Section  3  extends  the  OWL  datatyping 
to  unary  datatype  groups,  which  enables  the  use  of  customised  datatypes.  Section  4 
and  5  present  the  OWL-Eu  and  the  OWL-E  languages,  respectively;  the  latter  one  is 
based  on  datatype  groups,  which  are  general  forms  of  unary  datatype  groups.  Section  6 
concludes  the  paper  and  suggests  some  future  work. 


2  Motivations 

Allowing  users  to  define  their  own  vocabulary  is  one  of  the  most  useful  features  that 
ontologies  can  provide  over  other  approaches,  such  as  the  Dublin  Core,  of  providing 
semantics  in  the  Semantic  Web.  In  the  Dublin  Core  standard,  the  meaning  of  the  set 
of  15  information  properties  are  described  in  English  text.  The  main  drawback  of  the 
Dublin  Core  is  its  inflexibility;  it  is  impossible  to  ‘predefine’  information  properties  for 
all  sorts  of  applications. 

Ontologies,  however,  are  more  flexible  in  that  users  can  define  their  own  vocabu¬ 
lary  based  on  existing  vocabularies.  In  ontology  languages,  a  set  of  class  constructors 
are  usually  provided  so  that  users  can  build  class  expressions  based  on,  for  example,  ex¬ 
isting  class  names.  The  intended  meaning  of  the  vocabulary,  therefore,  can  be  captured 
by  the  axioms  in  the  ontologies.  Let  us  revisit  Example  1  and  consider  the  intended 
meaning  of  the  Adult  class.  According  to  its  definition,  an  Adult  is  a  Person  who  is  at 
least  18  years  old.  As  a  result,  programs  can  also  understand  the  meaning  of  customised 
vocabulary,  with  the  help  of  ontologies. 

Although  OWL  DL  provides  a  set  of  expressive  class  constructors  to  build  cus¬ 
tomised  classes,  it  does  not  provide  enough  expressive  power  to  support,  for  example, 
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XML  Schema  customised  datatypes.  In  order  to  capture  the  intended  meaning  of  Adult, 
Example  1  has  already  shown  the  necessity  of  customised  datatypes.  In  what  follows, 
we  give  some  more  examples  to  illustrate  the  usefulness  of  customised  datatypes  and 
datatype  predicates  in  various  SW  and  ontology  applications. 

Example  2.  Semantic  Web  Service:  Matchmaking 

Matchmaking  is  a  process  that  takes  a  service  requirement  and  a  group  of  service 
advertisements  as  input,  and  returns  all  the  advertisements  that  may  potentially  satisfy 
the  requirement.  In  a  computer  sales  ontology,  a  service  requirement  may  ask  for  a  PC 
with  memory  size  greater  than  512Mb,  unit  price  less  than  700  pounds  and  delivery 
date  earlier  than  15/03/2004. 

Here  ‘greater  than  5 12’,  ‘less  than  700’  and  ‘earlier  than  15/03/2004’  are  customised 
datatypes  of  base  datatypes  integer,  integer  and  date,  respectively. 

Example  3.  Electronic  Commerce:  A  ‘No  Shipping  Fee’  Rule 

Electronic  shops  may  need  to  classify  items  according  to  their  sizes,  and  to  reason 
that  an  item  for  which  the  sum  of  height,  length  and  width  is  no  greater  than  15cm 
belongs  to  a  class  in  their  ontology,  called  ‘small-items’.  Then  they  can  have  a  rule 
saying  that  for  ‘small-items’  no  shipping  costs  are  charged.  Accordingly,  the  billing 
system  will  charge  no  shipping  fees  for  all  the  instances  of  the  ‘small-items’  class. 

Here  ‘greater  than  15’  is  a  customised  datatype,  ‘sum’  is  a  datatype  predicate,  while 
‘sum  no  greater  than  15’  is  a  customised  datatype  predicate. 

3  Unary  Datatype  Groups 

The  OWL  datatyping  is  defined  based  on  the  notion  of  datatype  maps  [9],  A  datatype 
map  is  a  partial  mapping  from  supported  datatype  URIrefs  to  datatypes.  In  this  section, 
we  introduce  unary  datatype  groups,  which  extend  the  OWL  datatyping  with  a  hierarchy 
of  supported  datatypes. 

Definition  1  A  unary  datatype  group  Q  is  a  triple  (M,;,B,dom),  where  M,/  is  the 
datatype  map  of  Q.  B  is  the  set  of  primitive  base  datatype  URI  references  in  Q  and 
dom  is  the  declared  domain  function.  We  call  S  the  set  of  supported  datatype  URI  ref¬ 
erences  of  Q,  i.e.,  for  each  a  £  S,  is  defined;  we  require  BCS.  We  assume  that 

there  exist  unary  datatype  URI  reference  rdfs:  Literal,  owlx:DatatypeBottom  ^  S. 
The  declared  domain  function  dom  has  the  following  properties;  for  each  u  £  S,  if 
u  £  B,  dom(u)  =  u\  otherwise,  dom(u)  =  v,  where  v  £  B.  o 

Definition  1  ensures  that  all  the  primitive  base  datatype  URIrefs  of  Q  are  supported 
(B  C  S)  and  that  each  supported  datatype  URIref  relates  to  a  primitive  base  datatype 
URIref  through  the  declared  domain  function  dom. 

Example  4.  Q\  =  (M^,  Bi,  domi)  is  a  unary  datatype  group,  where 

-  M,; i  =  {xsd:integer  i— >  integer,  xsdistring  i— ►  string,  xsdinonNegativelnteger 

h- >>o,  xsdx:integerLessThanN  i— ><N}, 

-  Bi  =  {xsdistring,  xsd:integer},  and 
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-  domi  =  {xsd:integer  h- >  xsd:integer,  xsd:string  i— >  xsd:string,  xsdinonNega- 
tivelnteger  i— »  xsd: integer,  xsdx: integer LessThanN  i— >  xsd: integer}. 

According  to  M^,  we  have  Si  =  {xsd : integer, xsd: string, xsd: nonNega- 
tive Integer,  xsdx:integerLessThanN},  hence  Bj  C  Sj.  <C> 

Based  on  a  unary  datatype  group,  we  can  provide  a  formalism  (called  datatype  ex¬ 
pressions)  for  constructing  customised  datatypes  using  supported  datatypes. 

Definition  2  Let  Q  be  a  unary  datatype  group.  The  set  of  Q -unary  datatype  expres¬ 
sions  in  abstract  syntax  (corresponding  DL  syntax  can  be  found  in  Table  5  on  page  8), 
abbreviated  Dexpft/j,  is  inductively  defined  as  follows: 

1.  atomic  expressions  u  £  Dexp(C/),  for  a  datatype  URIref  it; 

2.  relativised  negated  expressions  not  (it)  £  Dexp(C/),  for  a  datatype  URIref  it; 

3.  enumerated  expressions  oneOf  (h, . . . ,  l„)  £  Dexp(C/),  for  literals  Zi , . . . ,  ln\ 

4.  conjunctive  expressions  and (Ei,...,En)  £  Dexp(C/),  for  unary  datatype  expressions 
-Ei,  E„  £  Dexp(<5); 

5.  disjunctive  expressions  or(Ei, ...,  En)  £  Dexp(d),  for  unaty  datatype  expressions 

Ex, ...,  E„  G  Dexp(<5).  o 


Example  5.  (/-unary  datatype  expressions  can  be  used  to  represent  XML  Schema  non¬ 
list  simple  types.  Given  the  unary  datatype  group  Qi  presented  in  Example  4  (page  3), 

-  the  following  XML  Schema  derived  union  simple  type 

<simpleType  name  =  “cameraPrice”> 

<union> 

<simpleType> 

^restriction  base  =  “xsd:nonNegativeInteger”> 

<maxExclusive  value  =  “100000” /> 

</restriction> 

</simpleType> 

<simpleType> 

<restriction  base  =  “xsd : string” > 

<enumeration  value  =  “low”/> 

<enumeration  value  =  “medium”/> 

<enumeration  value  =  “expensive”/> 

</restriction> 

</simpleType> 

</union> 

<simpleType> 

can  be  represented  by  the  following  disjunctive  expression 
or( 

and(xsd:nonNegativeInteger,  xsdx:integerLessThanl00000) 
o n eO  f ( “low”  "xsd :  string,  “medium”  "xsd :  string,  “expensive”  "xsd :  string) 

)• 

Note  that  “low” "xsd: string  is  a  typed  literal,  which  represents  a  value  of  the 
xsd :  string  datatype,  “low”,  instead,  is  a  plain  literal,  where  no  datatype  informa¬ 
tion  is  provided.  <C> 


We  now  define  the  interpretation  of  a  unary  datatype  group. 
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Abstract  Syntax 

DL  Syntax 

Semantics 

a  datatype  URIref  u 

u 

uu 

oneOf(Zi, . . . ,  ln) 

{7l  j  ,  In} 

{/nu...u{z°} 

not(u) 

U 

(dom(u))°  \  uu  if  u  £  S  \  B 
zAd  \  vP  otherwise 

and(i?i, .  . .  ,  En) 

Ei  A  ...  A  En 

or  (P,Q) 

EiV.-.VEn 

E?  u . . .  u  EP 

Table  1.  Syntax  and  semantics  of  datatype  expressions  (OWL-Eu  data  ranges) 


Definition  3  A  datatype  interpretation  Ir>  of  a  unary  datatype  group  Q  = 
(Md,B,dom)  is  a  pair  (zAd.-d),  where  ZAd  (the  datatype  domain)  is  a  non-empty 
set  and  •D  is  a  datatype  interpretation  function,  which  has  to  satisfy  the  following  con¬ 
ditions: 

1.  (rdfs: Literal)0  =  ZAd  and  (owlx:DatatypeBottom)°  =  0; 

2.  for  each  plain  literal  l,  P  =  l  £  PL  and  PL  C  ZAd  (PL  is  the  value  space  for  plain 
literals); 

3.  for  any  two  primitive  base  datatype  URIrefs  Ui,U2  £  B:  uP  Cl  u°  =  0; 

4.  for  each  supported  datatype  URIref  u  £  S.  where  d  =  Md(w): 

(a)  vP  =  V{d)  C  zAd.  L(u)  C  Z,(dom(u))  and  L2V ( u )  C  L2V (dom(-u)); 

(b)  if  s  £  L{d),  then  (“s”“w)D  =  L2V (d)(s);  otherwise,  (“s”“u)°  is  not  defined; 

5.  Vm  ^  S,  u°  C  zAd,  and  “v”“w  £  vP . 

Moreover,  we  extend  -D  to  Q  unary  datatype  expression  as  shown  in  Table  5 
(page  8).  Let  E  be  a  Q  unary  datatype  expression,  the  negation  of  E  is  of  the  form 
which  is  interpreted  as  zAd  \  E D.  o 

Next,  we  introduce  the  kind  of  basic  reasoning  mechanisms  required  for  a  unary 
datatype  group. 

Definition  4  Let  V  be  a  set  of  variables,  Q  —  (M^,  B,  dom)  a  unary  datatype  group 
and  u  £  B  a  primitive  base  datatype  URIref.  A  datatype  conjunction  of  u  is  of  the  form 

k  l 

^  =  /\uj{vj)  A /\  (1) 

j=l  i= 1 

where  the  v3  are  variables  from  V,  v[l\  v:^  are  variables  in  Ay=i  uj  {vj ),  Uj  are 
datatype  URI  references  from  S  such  that  dom(ttj)  =  u,  and  are  the  inequality 
predicates  for  primitive  base  datatypes  M,i(dom(iti))  where  u.:  appear  in  Ay=i  uj(vj)- 
A  datatype  conjunction  P  is  called  satisfiable  iff  there  exists  an  interpretation 
(zAd,  -d)  of  Q  and  a  function  S  mapping  the  variables  in  P  to  data  values  in  zAd  s.t. 
S(vj)  £  uP  (for  all  1  <  j  <  k)  and  {5(v^),  <5(t4^)}  Q  UF  and  <5(^1*^)  A  ^(v2^)  P°r 
all  1  <  i  <  l).  Such  a  function  S  is  called  a  solution  for  P  w.r.t.  (ZAd,  •d), 

We  end  this  section  by  elaborating  the  conditions  that  computable  unary  datatype 
groups  require. 
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Definition  5  A  unary  datatype  group  Q  is  conforming  iff 

1.  for  any  u  £  S  \  B:  there  exists  u'  €  S  \  B  such  that  u,D  =  vP,  and 

2.  for  each  primitive  base  datatype  in  Q ,  the  satisfiability  problems  for  finite  datatype 

conjunctions  of  the  form  (1)  is  decidable.  o 

4  OWL-Eu 

In  this  section,  we  present  a  small  extension  of  OWL  DL,  i.e.,  OWL-Eu.  The  underpin¬ 
ning  DL  of  OWL-Eu  is  SEiOTM  (fG  f),  i.e.,  the  S'HOJAf  DL  combined  with  a  unary 
datatype  group  Q  (1  for  unary).  Specifically,  OWL-Eu  (only)  extends  OWL  data  range 
(i.e.,  enumerated  datatypes  as  well  as  some  built-in  XML  Schema  datatypes)  to  OWL- 
Eu  data  ranges  defined  as  follows. 

Definition  6  An  OWL-Eu  data  range  is  a  Q  unary  datatype  expression.  Abstract  (as 
well  as  DL)  syntax  and  model-theoretic  semantics  of  OWL-Eu  data  ranges  are  presented 
in  Table  5  (page  8).  o 

The  consequence  of  the  extension  is  that  customised  datatypes,  represented  by 
OWL-Eu  data  ranges,  can  be  used  in  datatype  exists  restrictions  (3 T.u)  and  datatype 
value  restrictions  (VT.u),  where  T  is  a  datatype  property  and  u  is  an  OWL-Eu  data 
range.  Hence,  this  extension  of  OWL  DL  is  as  large  as  is  necessary  to  support  cus¬ 
tomised  datatypes. 

Example  6.  PCs  with  memory  size  greater  than  or  equal  to  512  Mb  and  with  price 
cheaper  than  700  pounds  can  be  represented  in  the  following  OWL-Eu  concept  descrip¬ 
tion  in  DL  syntax  (cf.  Table  5  on  page  8): 

PC  13  3memorySizeInMb.<5V2  13  BpricelnPound.  <700, 

where  <512  is  a  relativised  negated  expression  and  <700  is  a  supported  datatype  in  Q\ . 
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It  turns  out  that  OWL-Eu  (i.e.,  the  S'HOIN (Q  x )  DL)  is  decidable. 

Theorem  1.  The  SHOIM  {G  fj-concept  satisfiability  problem  w.r.t.  a  knowledge  base 
is  decidable  if  the  combined  unary  datatype  group  is  conforming. 

Proof:  (Sketch)  We  will  show  the  decidability  of  STiOTAf  (G  f) -concept  satisfiability 
w.r.t.  TBoxes  and  RBoxes  by  reducing  it  to  the  SHOXAT-concept  satisfiability  w.r.t. 
TBoxes  and  RBoxes.  The  basic  idea  behind  the  reduction  is  that  we  can  replace  each 
datatype  group-based  concept  C  in  T  with  a  new  atomic  primitive  concept  Ac  in  T’ . 
We  then  compute  the  satisfiability  problem  for  all  possible  conjunctions  of  datatype 
group-based  concepts  (and  their  negations)  in  T  (of  which  there  are  only  a  finite 
number),  and  in  case  a  conjunction  Cj  13  ...  13  Cn  is  unsatisfiable,  we  add  an  axiom 
Ac±  13  ...  13  Acn  E  J-  to  TC  For  example,  unary  datatype  group-based  concepts 
3 T.  >1  and  VT.  <0  occurring  in  T  would  be  replaced  with  .4 377 > ,  and  ,4VT.<rj  in 
T' ,  and  A-^t.>i  n  A\jt.<0  E  J-  would  be  added  to  T'  because  3T.  >1  13  VT.  <0  is 
unsatisfiable  (i.e.,  there  is  no  solution  for  the  predicate  conjunction  >1  0)  A  <0  0)). 
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5  OWL-E:  A  Step  Further 


In  this  section,  we  present  a  further  extension  of  OWL-Eu,  called  OWL-E,  which  sup¬ 
ports  not  only  customised  datatypes,  but  also  customised  datatype  predicates. 

A  datatype  predicate  (or  simply  predicate)  p  is  characterised  by  an  arity  a(p),  or 
a  minimum  arity  amin(jp)  if  p  can  have  multiple  arities,  and  a  predicate  extension  (or 
simply  extension)  E(p).  The  notion  of  predicate  maps  can  be  defined  in  an  obvious 
way.  For  example,  =mt  is  a  (binary)  predicate  with  arity  a(=lnt)  =  2  and  extension 
E(=mt )  =  {(ii,  if)  £  U(integer)2  |  ii  =  i2},  where  U(integer)  is  the  value  space 
for  the  datatype  integer. 

Now  we  can  generalise  unary  datatype  groups  by  the  definition  of  datatype  groups. 
In  fact,  datatypes  and  datatype  predicates  can  be  unified  in  datatype  groups.  Roughly 
speaking,  a  datatype  group  is  a  group  of  built-in  predicate  URIrefs  ‘wrapped’  around  a 
set  of  primitive  datatype  URIrefs.  A  datatype  group  Q  is  a  tuple  (Mp,B,dom),  where 
M/,  is  the  predicate  map  of  (/,  B  is  the  set  of  primitive  datatype  URI  references  in  Q 
and  dom  is  the  declared  domain  function.  We  call  S  the  set  of  built-in  predicate  URI 
references  of  Q,  i.e.,  for  each  u  £  S,  Mp(it)  is  defined;  we  require  BCS.  The  declared 
domain  function  dom  has  the  following  properties:  for  each  u  £  S, 


'  u 

(vi,. .  .,vn),  where  vi, . . . ,  vn  £  B 


dom(ii)  =  < 


{(v, . ... ,  v)  |  *  >  n},  where  »eB 


i  times 


if  u  €  B, 
if  u  £  S  \  B  and 
«(Mp(u))  =  n, 
if  u  £  S  \  B  and 

CLmin  (Mp(w))  =  n. 


Example  7.  Q2  =  (MP2,  B2,  dom2)  is  a  datatype  group,  where 

-  MP2  =  {xsd:integer  integer,  xsd:string  string,  xsd:integerGreaterThanOr- 

EqualToN  >n,  xsdx:integerLessThanN  <n,  xsdx:  integer  Equality  1— >!  =lnt}, 

-  B2  =  {xsd:string,  xsd:integer},  and 

-  dom2  =  {xsd: integer  xsd: integer,  xsd: string  1— >  xsd: string,  xsd : integerGreater- 

ThanOrEqualToN  1— >  xsd: integer,  xsdx: integer LessThanN  1— ►  xsd: integer, 
xsdx:integerEquality  (xsd: integer, xsd: integer)}. 


According  to  MP2,  we  have  S2  =  {xsd: integer, xsd: string, xsd: nonNega- 
tivelnteger,  xsdx :integerLessThanN, xsdx: integerEquality},  hence  B2  C  S2.  <> 


Furthermore,  based  on  datatype  groups,  we  can  extend  unary  datatype  expres¬ 
sions  to  general  (n-ary)  datatype  expressions.  While  enumerated  expressions  remain 
the  same,  relativised  negated,  conjunctive  and  disjunctive  unary  datatype  expressions 
can  be  easily  extended  to  the  n-nary  case.  There  is  a  new  kind  of  datatype  expres¬ 
sion  called  domain  expression :  domain(iti, . . .  ,un).  where  Ui  is  either  rdfs:Literal 
or  supported  unary  datatype  predicate  URIrefs,  or  their  relativised  negations.  For  ex¬ 
ample,  the  customised  predicate  ‘sumNoGreaterThanOrEqualTol5’,  with  extension 
E(sumNoGreatei'ThanOrEqualTol5 )  =  {(to, *i, «2, *3)  £  U(integer)4  |  * 0  = 


7 


Abstract  Syntax 

DL  Syntax 

Semantics 

not(u) 

U 

Ad  \uu  if  u  £  B 

(dom(it))D  \  vP  if  u  £  S  \  B 
Un>1(AD)"  \  uD  otherwise 

domainfui,  ■  •  • ,  un) 

\U\ ,  .  .  .  ,  V>n\ 

X  ...  X  Un 

Table  2.  Syntax  and  semantics  of  (new)  datatype  expressions 


i\  +  «2  +  *3  and  ->(i0  >  15)}  and  arity  a(sumN oGreaterThanOrEqualTolb)  =  4, 
can  be  represented  by 


xsdx:  integer  Addition  A 

[  xsdx:integerGreaterThanOrEqualTol5,  xsd:integer,  xsd:integer,  xsd:integer  ], 


which  is  a  conjunctive  expression,  where  the  first  conjunct  is  a  predicate  URIref  (that 
represents  +mt)  and  the  second  conjunct  is  a  domain  expression. 

We  can  extend  the  datatype  interpretation  2d  presented  in  Definition  3  to  give  se¬ 
mantics  to  datatype  groups.  For  each  u  £  S,  vP  =  -E(Mp(w))  C  (dom(u))D,  where 
(dom(M))D  is  defined  as  follows:  if  dom(u)  =  (d1; . . . ,  dn)  and  a(Mp(w))  =  n,  then 
(dom(u))D  =  dP  x  ...  x  d Jp;  if  dom(u)  =  {(d, . . .  ,d)  |  i  >  n}  and  amj„(Mp(w))  = 


i  times 

n,  then  (dom(it))D  =  (J,>  (dD)*.  The  abstract  syntax,  DL  syntax  and  semantics  of 
relativised  negated  and  domain  expressions  are  presented  in  Table  5. 

The  following  definition  summarises  the  conditions  that  computable  datatype 
groups  require. 


Definition  7  (Conforming  Datatype  Group)  A  datatype  group  Q  is  conforming  iff 

1.  for  any  u  £  S  \  B  with  a(Mp(tt))  =  n  >  2:  dom(w)  =  (w, . . .  ,w)  for  some 

n  times 

w  G  B,  and 

2.  for  any  u  G  S  \  B:  there  exist  v!  G  S  \  B  such  that  u,D  =  vP,  and 

3.  the  satisfiability  problem  for  finite  negation-free  predicate  conjunctions  is  decid¬ 
able,  and 

4.  for  each  primitive  datatype  URIref  u,;  £  B,  there  exists  wi  £  S,  s.t.  Mp  (Wi)  =^Ui 

where  is  the  binary  inequality  predicate  for  Mp(iq).  o 


Finally,  OWL-E  extends  OWL-Eu  with  the  datatype  group-related  class  construc¬ 
tors  presented  in  Table  3. 

Example  8.  (OWL-E  classes) 

Assume  that  electronic-shops  want  to  define  small  items  as  items  of  which  the  sum 
of  height,  length  and  width  is  no  greater  than  or  equal  to  15cm.  The  Smallltem  class 
can  be  represented  by  the  following  datatype  group-based  concept  description: 


3 Ts,Th,Ti,Tw.(+XTlt  A  [>15 ,  integer,  integer,  integer]), 


New  Element 

DL  Syntax 

Semantics 

expressive  predicate 
exists  restriction 

3Ti,.. 

■  ,Tn.E 

{x  £  A1  \  3H, . .  .,tn.{x,ti)  £  Tx  (for  all 

1  <  i  <  m)  A  (tt, . . .  ,t„)  £  ED} 

expressive  predicate 
value  restriction 

VTi... 

■  ,Tn.E 

{x  £  A1  \  Vti, . . .  ,t„.(x,ti)  £  Tx  (for  all 

1  <  i  <  m)  — >  (ti, . . . ,  t„)  €  ED} 

expressive  predicate 
atleast  restriction 

. . .  ,Tn.E 

{x  £  A1  jt{(tr, . . . ,  t„)  |  (x,ti)  €  Tx  (for  all 

1  <  *  <  m)  A  (ti, . . . ,  t„)  €  i?D}  >  m} 

expressive  predicate 
atmost  restriction 

/A 

§ 

. . ,  Tn.E 

{x  G  A1  |  ${(ti,...,tn)  |  (x,U)  £  Tx  (for  all 

1  <  i  <  m)  A  (ti, ,  ■  ■ ,  tn)  €  J3d}  <  m} 

Table  3.  New  class  constructors  in  OWL-E 


where  Ts ,  T)( .  T[ ,  Tw  are  concrete  roles  representing  “sum  in  cm”,  “hight 
in  cm”,  “length  in  cm”  and  “width  in  cm”,  respectively,  and  (+mt  a 
[>15>  integer,  integer,  integer ])  is  a  conjunctive  datatype  expression  representing  the 
customised  predicate  “sum  no  larger  than  or  equal  to  15”. 1  <C> 

Like  OWL-Eu,  OWL-E  (i.e.,  the  S'HOTQ(G)  DL)  is  also  a  decidable  extension  of 
OWL-DL. 

Theorem  2.  The  STLOTff '(G)-  and  STLOTQ{Q)-concept  satisfiability  and  subsump¬ 
tion  problems  w.r.t.  TBoxes  and  RBoxes  are  decidable. 

According  to  Tobies  [13,  Lemma  5.3],  if  C  is  a  DL  that  provides  the  nominal  con¬ 
structor,  knowledge  base  satisfiability  can  be  polynomially  reduced  to  satisfiability  of 
TBoxes  and  RBoxes.  Hence,  we  obtain  the  following  theorem. 

Theorem  3.  The  knowledge  base  satisfiability  problems  of  STtOIAf{G)  and 
SHOTQ(G)  are  decidable. 

6  Conclusion 

In  this  paper,  we  propose  OWL-Eu  and  OWL-E,  two  decidable  extensions  of  OWL 
DL  that  support  customised  datatypes  and  customised  datatype  predicates.  OWL-Eu 
provides  a  general  framework  for  integrating  OWL  DL  with  customised  datatypes,  such 
as  XML  Schema  non-list  simple  types.  OWL-E  further  extends  OWL-Eu  to  support 
customised  datatype  predicates. 

We  have  implemented  a  prototype  extension  of  the  FaCT  [5]  DL  system,  called 
FaCT-DG,  to  support  TBox  reasoning  in  both  OWL-Eu  and  OWL-E  (without  nomi- 
nals).  As  for  future  work,  we  are  planning  to  extend  the  DIG1.1  interface  [3]  to  sup¬ 
port  OWL-Eu,  and  to  implement  a  Protege  [6]  plug-in  to  support  XML  Schema  non-list 
simple  types,  i.e.  users  should  be  able  to  define  and/or  import  customised  XML  Schema 
non-list  simple  types  based  on  a  set  of  supported  datatypes,  and  to  exploit  our  prototype 
through  the  extended  DIG  interface.  Furthermore,  we  plan  to  extend  the  FaCT++  DL 
reasoner  [4]  to  support  the  full  OWL-Eu  and  OWL-E  ontology  languages. 

1  To  save  space,  we  use  predicates  instead  of  predicate  URIrefs  here. 
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The  Irresistible  S1ZXQ 
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School  of  Computer  Science,  The  University  of  Manchester, 
Kilburn  Building,  Oxford  Road,  Manchester,  M13  9PL,  UK, 
{Horrocks,  Kutz,  Sattler}@cs .man. ac .uk 


Abstract.  Motivated  primarily  by  medical  terminology  applications, 
the  prominent  DL  STiTQ  has  already  been  extended  to  a  DL  with  com¬ 
plex  role  inclusion  axioms  of  the  form  R  o  S  □  R  or  S  o  R  t  _R,  called 
7 ZTQ,  and  the  STCIQ  tableau  algorithm  has  been  extended  to  handle 
such  inclusions. 

This  paper  further  extends  TZTQ  and  its  tableau  algorithm  with  im¬ 
portant  expressive  means  that  are  frequently  requested  in  ontology  ap¬ 
plications,  namely  with  reflexive,  symmetric,  transitive,  and  irreflexive 
roles,  disjoint  roles,  and  the  construct  Bit. Self,  allowing,  for  instance,  the 
definition  of  concepts  such  as  a  “narcist” .  Furthermore,  we  extend  the  al¬ 
gorithm  to  cover  Abox  reasoning  extended  with  negated  role  assertions. 
The  resulting  logic  is  called  STITQ. 


1  Introduction 

We  describe  an  extension,  called  STZIQ,  of  the  description  logic  (DL)  SHIM 
(10)  underlying  OWL  lite  and  OWL  DL  (7).  We  believe  that  STZIQ  enjoys 
some  useful  properties.  Firstly,  SIZIQ  extends  SHIM  with  numerous  expres¬ 
sive  means  which  have  been  asked  for  by  users,  and  which,  we  believe,  will  make 
modeling  using  DLs  easier  and  more  intuitive.  While  the  language  of  SIZIQ  is 
designed  to  be  slightly  redundant  in  the  sense  that  some  of  the  new  expressive 
means  can  be  simulated  by  others,  the  complete  absence  of  those  expressive 
means  has  proven  quite  harmful  since  developers  of  ontologies  use  work-arounds 
to  compensate  for  this.  As  a  consequence,  ontologies  become  cluttered,  com¬ 
plicated,  and  difficult  to  understand.  In  the  worst  case,  the  work-around  only 
partially  captures  the  intended  semantics,  thus  leading  to  unintended  or  missing 
consequences,  thereby  destroying  one  of  the  main  features  of  a  logic-based  for¬ 
malism,  namely  its  well-defined  semantics  and  reasoning  services.  A  well-known 
example  are  qualified  number  restrictions.  Their  absence  in  OWL  lite  and  OWL 
DL  has  caused  problems  in  the  past  (12),  and  has  led  to  the  development  and  use 
of  questionable  surrogates.  Hence,  STZIQ  provides  qualified  number  restrictions. 
Other,  novel  expressive  means  of  STZIQ  concern  mostly  roles  and  include: 

—  disjoint  roles.  E.g.,  the  roles  sister  and  mother  could  be  declared  as  being 
disjoint.  Most  DLs  can  be  said  to  be  “lopsided”  since  they  allow  to  express 
disjointness  on  concepts  but  not  on  roles,  despite  the  fact  that  role  disjoint¬ 
ness  is  quite  natural  and  can  generate  new  subsumptions  or  inconsistencies 
in  the  presence  of  role  hierarchies  and  number  restrictions. 
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—  reflexive  and  irreflexive  roles.  E.g.,  the  role  knows  could  be  declared  as  being 
reflexive,  and  the  role  sibling  could  be  declared  as  being  irreflexive.  In  the 
presence  of  the  new  concept  3R. Self  described  below,  reflexive  and  irreflexive 
roles  also  become  definable  by  Tbox  assertions. 

—  negated  role  assertions.  Most  Abox  formalisms  only  allow  for  positive  role 
assertions  (with  few  exceptions  (1;  5)),  whereas  S1Z1Q  also  allows  for  state¬ 
ments  such  as  (John,  Mary)  :  ^likes.  In  the  presence  of  complex  role  inclu¬ 
sions,  negated  role  assertions  can  be  quite  useful  and,  like  disjoint  roles,  they 
overcome  a  certain  “lopsidedness”  of  DLs. 

—  Since  S1ZIQ  extends  SH1Q ,  we  can  also  express  that  a  role  is  transitive  or 
symmetric,  and  can  use  role  inclusion  axioms 

—  Since  S1ZXQ  extends  1Z1Q  (8),  we  can  use  complex  role  inclusion  axioms 
of  the  form  R  o  S  C  R  and  S  o  R  C  R.  For  example,  w.r.t.  the  axiom 
ownsohasPart  t  owns,  and  the  fact  that  each  car  contains  an  engine  Car  £ 
dhasPart. Engine,  an  owner  of  a  car  is  also  an  owner  of  an  engine,  i.e.,  the 
following  subsumption  is  implied:  downs. Car  C  downs. Engine. 

—  Finally,  S1Z1Q  allows  for  concepts  of  the  form  31?. Self  which  can  be  used  to 
express  “local  reflexivity”  of  a  role  R,  e.g.,  to  define  the  concept  “narcist” 
using  dlikes.Self. 

Besides  a  Tbox  and  an  Abox,  S1Z1Q  provides  a  so-called  Rbox  to  gather  all 
statements  concerning  roles. 

Secondly,  S1Z1Q  is  designed  to  be  of  similar  practicability  as  SHIQ.  The 
tableau  algorithm  for  SH1Q  and  the  one  for  S1Z1Q  presented  here  are  very 
similar.  Even  though  the  additional  expressive  means  of  S1ZIQ  require  certain 
adjustments  to  the  SH1Q  algorithm,  these  adjustments  do  not  add  new  sources 
of  non-determinism,  and,  subject  to  empirical  verification,  are  believed  to  be 
“harmless”  in  the  sense  of  not  significantly  degrading  typical  performance  as 
compared  with  the  SHIQ  algorithm.  More  precisely,  we  employ  the  same  tech¬ 
nique  using  finite  automata  as  in  (8)  to  handle  role  inclusions  R  o  S  t  R  and 
S  o  R\Z  R.  This  involves  a  pre-processing  step  which  takes  an  Rbox  and  builds, 
for  each  role  R,  a  finite  automaton  that  accepts  exactly  those  words  Ri . . .  Rn 
such  that,  in  each  model  of  the  Rbox,  (x,  y)  €  (i?i . . .  Rn)X  implies  ( x ,  y)  €  R1. 
These  automata  are  then  used  in  the  tableau  expansion  rules  to  check,  for  a 
node  x  with  \/R.C  £  C(x)  and  an  R\ . . .  i?n-neighbour  y  of  x,  whether  to  add  C 
to  Even  though  the  pre-processing  step  might  appear  a  little  cumbersome, 
the  usage  of  the  automata  in  the  algorithm  makes  it  quite  elegant  and  compact. 

The  current  paper  describes  work  in  progress  towards  a  description  logic 
that  overcomes  certain  shortcomings  in  expressiveness  of  other  DLs.  We  have 
used  SHIM,  SHIQ ,  and  1ZIQ  as  a  starting  point,  extended  them  with  some 
“useful- yet-harmless”  expressive  means,  and  also  extended  the  tableau  algorithm 
accordingly.  We  wish  to  discuss  this  extension  in  case  we  have  overlooked  other 
“useful- yet-harmless”  expressive  means,  and  we  plan  to  further  extend  S1ZIQ: 
currently,  various  new  operators  are  restricted  to  simple  roles,  and  we  have  yet 
to  establish  which  of  these  restrictions  are  necessary  in  order  to  preserve  decid- 
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ability1  or  practicability.  Moreover,  we  plan  to  extend  S1Z1Q  towards  SHOIQ 
(9),  i.e.,  to  also  include  nominals. 

For  a  full  specification  of  the  tableau  algorithm  and  proofs,  see  (6). 

2  The  Logic  S1ZXQ 

In  this  section,  we  introduce  the  DL  S1Z1Q.  This  includes  the  definition  of 
syntax,  semantics,  and  inference  problems. 

2.1  Roles,  Role  Hierarchies,  and  Role  Assertions 

Definition  1  (Interpretations).  Let  C  be  a  set  of  concept  names,  R  a  set 
of  role  names,  and  I  =  {a,  b,  c...}  a  set  of  individual  names.  The  set  of 
roles  is  RU  {f?-  j  R  £  R},  where  a  role  Rr  is  called  the  inverse  role  of  R. 

As  usual,  an  interpretation  T  =  (A1,-1')  consists  of  a  set  A1 ,  called  the 
domain  ofT,  and  a  valuation  -x  which  associates,  with  each  role  name  R,  a 
binary  relation  R1  C  Ax  x  Ax ,  with  each  concept  name  C  a  subset  Cx  C  Ax  and, 
with  each  individual  name  a  an  element  ax  £  Ax .  Inverse  roles  are  interpreted 
as  usual,  i.e.,  for  each  role  R  £  R,  we  have 

(R-)1  =  {( y,x )  |  (x,y)  £  R1}. 

Note  that,  unlike  in  the  case  of  STLIQ ,  we  did  not  introduce  transitive  role 
names.  This  is  so  since,  as  will  become  apparent  below,  role  box  assertions  can 
be  used  to  force  roles  to  be  transitive. 

To  avoid  considering  roles  such  as  R  ,  we  define  a  function  Inv  on  roles 
such  that  Inv(R)  =  R~  if  R  £  R  is  a  role  name,  and  lnv(i?)  =  S  £  R  if  R  =  S~ . 

Since  we  will  often  work  with  a  string  of  roles,  it  is  convenient  to  extend 
both  •x  and  Inv(-)  to  such  strings:  if  w  =  Ri . . .  Rn  for  It,  roles,  then  we  set 
wx  =  Rx o. .  ,o Rx  and  Inv(ru)  =  lnv(f?„) . . .  Inv(i?i),  where  o  denotes  composition 
of  binary  relations. 

A  role  box  IZ  consists  of  two  components.  The  first  component  is  a  role  hier¬ 
archy  7 Zh  which  consists  of  (generalised)  role  inclusion  axioms,  i.e.,  statements 
of  the  form  R  t  S,  RS  t  S,  and  SR  t  S.  The  second  component  is  a  set 
IZa  of  role  assertions  stating,  for  instance,  that  a  role  R  must  be  interpreted  as 
a  transitive,  reflexive,  irreflexive,  symmetric,  or  transitive  relation,  or  that  two 
(possibly  inverse)  roles  R  and  S  are  to  be  interpreted  as  disjoint  binary  relations. 

We  start  with  the  definition  of  a  role  hierarchy,  whose  definition  involves 
a  strict  partial  order  -<  on  roles,  i.e.,  an  irreflexive  and  transitive  relation  on 
R  U  {R~  |  R  £  R}. 

Definition  2  ((Regular)  Role  Inclusion  Axioms). 

Let  -<  be  a  strict  partial  order  on  roles.  A  role  inclusion  axiom  (RIA  for 
short)  is  an  expression  of  the  form  w  C  R,  where  w  is  a  finite  string  of  roles, 
and  R  is  a  role  name.  A  role  hierarchy  IZh,  then,  is  a  finite  set  of  RIAs. 

1  See  (10)  for  such  a  case. 
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An  interpretation  X  satisfies  a  role  inclusion  axiom  Si . . .  Sn  E  R,  if 

Sf  O  .  .  .  o  SX  C  If, 

where  o  stands  for  the  composition  of  binary  relations.  An  interpretation  is  a 
model  of  a  role  hierarchy  Rh,  if  it  satisfies  all  RIAs  in  Rh,  written  X  |=  Rh- 
A  RIA  w  G  R  is  -<-regular  if 

—  R  is  a  role  name, 

—  w  =  RR, 

—  w  =  R~, 

—  w  =  Si ...  Sn  and  Si  -<  R,  for  all  \  <  i  <  n, 

—  w  =  RS\ . . .  Sn  and  Si  -<  R,  for  all  1  <  i  <  n,  or 

—  w  =  Si . . .  SnR  and  Si  ~<  R,  for  all  1  <  i  <  n. 

Finally,  a  role  hierarchy  IZh  is  said  to  be  regular  if  there  exists  a  strict  partial 
order  -<  on  roles  such  that  each  RIA  in  IZh  is  ~<-regular. 

Regularity  prevents  a  role  hierarchy  from  containing  cyclic  dependencies.  For 
instance,  the  role  hierarchy 

{RS  E  S',  RTt  R,  UT  CT,  US  E  U} 

is  not  regular  because  it  would  require  -<  to  satisfy  S^U^,T^R^,S,  which 
would  imply  S  -<  S,  thus  contradicting  irrcflexivity.  Such  cyclic  dependencies 
are  known  to  lead  to  undecidability  (8). 

From  the  definition  of  the  semantics  of  inverse  roles,  it  follows  immediately 
that 

(x,y)  £  ui1  iff  (y,x)  £  lnv('u;);r. 

Hence,  each  model  satisfying  w  C  S  also  satisfies  Inv(ie)  C  Inv(S')  (and  vice 
versa),  and  thus  the  restriction  to  those  RIAs  with  role  names  on  their  right 
hand  side  does  not  have  any  effect  on  expressivity. 

Given  a  role  hierarchy  IZh ,  we  define  the  relation  E  to  be  the  transitive- 
reflexive  closure  of  t  over  {R  t  S,  lnv(i?)  t  Inv(S)  |  R  t  S  £  7^/j}.  A  role  R 
is  called  a  sub-role  (resp.  super-role)  of  a  role  S  if  R  E  S  (resp.  S  E  Ri) .  Two 
roles  R  and  S  are  equivalent  ( R=S )  if  R  E  S  and  S  E  R. 

Note  that,  due  to  the  fourth  restriction  in  the  definition  of  ^-regularity,  we 
also  restrict  E  to  be  acyclic,  and  thus  regular  role  hierarchies  never  contain  two 
equivalent  roles.2 

Next,  let  us  turn  to  the  second  component  of  Rboxes,  the  role  assertions.  For 
an  interpretation  X,  we  define  Diag 1  to  be  the  set  {(x,  x)  j  x  £  A1}  and  set 
R1  |:=  {(x,x)  |  3 y  £  Ax.(x,y)  £  R1}. 

2  This  is  not  a  serious  restriction  for,  if  IZ  contains  E  cycles,  we  can  simply  choose 
one  role  R  from  each  cycle  and  replace  all  other  roles  in  this  cycle  with  R  in  the 
input  Rbox,  Tbox  and  Abox  (see  below). 
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Definition  3  (Role  Assertions).  For  roles  R  and  S,  we  call  the  assertions 
Ref(f?),  Irr(R),  Sym(7?),  Tra(7?),  and  Dis (R,S),  role  assertions,  where,  for 
each  interpretation  X  and  all  x,  y,  z  £  Ax ,  we  have: 

X  |=  Sym(7?)  if  (x,y)  £  R1  implies  (y,x)  £  R1', 

X  \=  Tra  (7?)  if  (x,y)  £  R1  and  ( y,z )  £  R1  imply  (x,z)  £  R1', 

X  |=  Ref(i?)  if  Rx  |C  Rx; 

X  |=  Irr(R)  if  Rx  n  Diag1  =  0; 

X  |=  D\s(R,S)  if  RxnSx  =  (t ). 


Adding  symmetric  and  transitive  role  assertions  is  a  trivial  move  since  both 
of  these  expressive  means  can  be  replaced  by  complex  role  inclusion  axioms  as 
follows:  for  the  role  assertion  Sym(f?)  we  can  add  to  the  Rbox,  equivalently,  the 
role  inclusion  axiom  R~  C  R,  and,  for  the  role  assertion  Tra (R),  we  can  add 
to  the  Rbox,  equivalently,  RR  C  R.  The  proof  of  this  should  be  obvious.  Thus, 
as  far  as  expressivity  is  concerned,  we  can  assume  for  convenience  that  no  role 
assertions  of  the  form  Tra (R)  or  Sym(R)  appear  in  720 ,  but  that  transitive  and 
symmetric  roles  will  be  handled  by  the  RIAs  alone. 

The  situation  is  different,  however,  for  the  other  Rbox  assertions.  Neither 
reflexivity  nor  irreflexivity  nor  disjointness  of  roles  can  be  enforced  by  role  inclu¬ 
sion  axioms.  However,  as  we  shall  see  later,  reflexivity  and  irreflexivity  of  roles 
are  closely  related  to  the  new  concept  372. Self. 

In  SHIQ,  the  application  of  qualified  number  restrictions  has  to  be  restricted 
to  certain  roles,  called  simple  roles,  to  preserve  decidability  (10).  In  the  context  of 
STZXQ,  the  definition  of  simple  role  has  to  be  slightly  modified,  and  simple  roles 
figure  not  only  in  qualified  number  restrictions,  but  in  several  other  constructs  as 
well.  Intuitively,  non-simple  roles  are  those  that  are  implied  by  the  composition 
of  roles. 

Given  a  role  hierarchy  72/,  and  a  set  of  role  assertions  720  (without  transitivity 
or  symmetry  assertions) ,  the  set  of  roles  that  are  simple  in  72  =  72/,  U  72a  is 
inductively  defined  as  follows: 


—  a  role  name  is  simple  if  it  does  not  occur  on  the  right  hand  side  of  a  RIA  in 

72/,, 

—  an  inverse  role  R~  is  simple  if  R  is,  and 

—  if  R  occurs  on  the  right  hand  side  of  a  RIA  in  1Z /,,  then  R  is  simple  if,  for 
each  w  C  R  £  72/,,  w  =  S  for  a  simple  role  S. 


A  set  of  role  assertions  7Za  is  called  simple  if  all  roles  R,  S  appearing  in  role 
assertions  of  the  form  Ref(R),  lrr(f?),  or  Dis(I?,  S)  are  simple  in  1Z.  If  7 Z  is  clear 
from  the  context,  we  often  use  “simple”  instead  of  “simple  in  72.”. 

Definition  4  (Role  Box).  A  SIZIQ-role  box  (Rbox  for  short.)  is  a  set  72  = 
72;,  U  720,  where  72/,  is  a  regular  role  hierarchy  and  720  is  a  finite,  simple  set  of 
role  assertions. 

An  interpretation  satisfies  a  role  box  72  (written  X  \=  1Z)  if  X  \=  Rh  and 
X  \=  (f>  for  all  role  assertions  f>  £  Ra-  Such  an  interpretation  is  called  a  model  of 

72. 
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2.2  Concepts  and  Inference  Problems  for  SRIQ 

We  are  now  ready  to  define  the  syntax  and  semantics  of  <S7UI<2-concepts. 

Definition  5  (SRIQ  Concepts,  Tboxes,  and  Aboxes).  The  set  of  SRIQ- 
concepts  is  the  smallest  set  such  that 

—  every  concept  name  and  T,  _L  are  concepts,  and, 

—  if  C,  D  are  concepts,  R  is  a  role  (possibly  inverse),  S  is  a  simple  role  (possibly 

inverse),  and  n  is  a  non-negative  integer,  then  C  n  D,  C  U  D,  -> C ,  VR.C, 
3 R.C,  dS'.Self.  nS.C ),  and  nS.C )  are  also  concepts. 

A  general  concept  inclusion  axiom  (GCI)  is  an  expression  of  the  form 
C  t  D  for  two  SRI Q- concepts  C  and  D.  A  Tbox  T  is  a  finite  set  of  GCIs. 

An  individual  assertion  is  of  one  of  the  following  forms:  a  :  C,  (a,  b) :  R, 
( a,b )  :  -i S,  or  a  ft  b,  for  a,b  £  I  (the  set  of  individual  names),  a  (possibly 
inverse)  role  R,  a  (possibly  inverse)  simple  role  S,  and  a  SRIQ-concept  C.  A 
SRIQ-Abox  A  is  a  finite  set  of  individual  assertions. 

Note  that  number  restrictions  (f^nS.C)  and  nS.C ),  the  concept  3S.Self, 
and  negated  role  assertions  (a,  b)  :  ~^S,  are  all  restricted  to  simple  roles.  In  the 
case  of  number  restrictions  we  mentioned  the  reason  for  this  restriction  already: 
without  it,  already  the  satisfiability  problem  of  57YJQ-concepts  is  undecidable 
(10),  even  for  a  logic  without  inverse  roles  and  with  only  unqualifying  number 
restrictions  (these  are  number  restrictions  of  the  form  (jznR.T)  and  (^fnR. T)). 
For  SRIQ  and  the  remaining  restrictions  to  simple  roles  in  concept  expressions 
as  well  as  role  assertions,  it  is  part  of  future  work  to  determine  which  of  these 
restrictions  to  simple  roles  are  necessary  in  order  to  preserve  decidability  or 
practicability.  For  example,  it  should  be  possible  to  also  allow  non-simple  roles 
in  negated  role  assertions  (a,  b) :  ~^R  without  losing  decidability. 

Note  also  that,  in  the  definition  of  SRIQ- Aboxes,  we  do  not  assume  the 
unique  name  assumption  (UNA)  (which  is  commonly  assumed  in  DLs  (4)). 
Rather,  by  allowing  inequalities  between  individuals  in  the  Abox  to  be  explic¬ 
itly  stated,  we  increase  flexibility  while,  obviously,  the  UNA  can  be  regained  by 
explicitly  stating  a  ft  b  for  every  pair  a,b  £  I  of  individuals.  Moreover,  notice 
that,  in  contrast  to  standard  Aboxes,  SRIQ- Aboxes  can  also  contain  negated 
role  assertions  of  the  form  (a,  b) :  ~^R. 

Definition  6  (Semantics  and  Inference  Problems). 

Given  an  interpretation  I  =  (Ax, -x),  concepts  C,  D,  roles  R,  S,  and  non¬ 
negative  integers  n,  the  extension  of  complex  concepts  is  defined  inductively 
by  the  following  equations,  where  j \M  denotes  the  cardinality  of  a  set  M : 

Tx  =  Ax ,  _LX  =  0,  (->C')X  =  Ax  \  Cx  (top,  bottom,  negation) 

{C  n  D)x  =  Cx  n  Dx ,  (C  U  D)x  =  Cx  U  Dx  (conjunction,  disjunction) 
(3 R.C)X  =  {x  |  3 y.(x,y)  £  Rx  and  y  £  Cxj  (exists  restriction) 

(3f?.Self)x  =  {x  j  ( x,x )  £  R1}  (3R. SeM -concepts) 

(\/R.C)x  =  {x  |  \/y.(x,y)  £  Rx  implies  y  £  C1}  (value  restriction) 
(^nR.C)x  =  {x  |  jj {y.(x,y)  £  Rx  and  y  £  Cx}  ^  n}  (atleast.  restriction) 
{^nR.C)x  =  {x  |  §{y.(x,  y)  £  Rx  and  y  £  Cx}  ^  n}  (atmost  restriction) 
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An  interpretation  X  is  a  model  of  a  Tbox  X  (written  X  \=  X)  iff  C 1  C  Dx 
for  each  GCI  C  t  D  in  X . 

A  concept  C  is  called  satisfiable  iff  there  is  an  interpretation  X  with  Cx  /  0. 
A  concept  D  subsumes  a  concept  C  (written  C  C  D)  iff  C1  C  Dx  holds  for 
each  interpretation.  Two  concepts  are  equivalent  (written  C  =  D)  if  they  are 
mutually  subsuming.  The  above  inference  problems  can  be  defined  w.r.t.  a  general 
role  boxTZ  and/or  a  Tbox  X  in  the  usual  way,  i.e.,  by  replacing  interpretation 
with  model  of  TZ  and/or  T . 

For  an  interpretation  X,  an  element  x  G  Ax  is  called  an  instance  of  a 
concept  C  iff  x  G  Cx . 

An  interpretation  X  satisfies  (is  a  model  of)  an  Abox  A  (X  \=  A)  if  for  all 
individual  assertions  (f>  G  A  we  have  X  |=  cf>,  where 

X  \=  a:  C  if  a1  G  Cx\  X  \=  a  /  b  if  ax  /  6X; 

X  \=  ( a,b):R  if  (a1 ,  b x)  G  Rx;  X  |=  (a,  b) :  -> R  if  (a1 ,  bx)  Rx . 

An  Abox  A  is  consistent  with  respect  to  an  Rbox  1Z  and  a  Tbox  X  if  there 
is  a  model  X  for  7 Z  and  X  such  that  X  \=  A. 

For  DLs  that  are  closed  under  negation,  subsumption  and  (un) satisfiability 
of  concepts  can  be  mutually  reduced:  C  C  D  iff  C  l~1  ->D  is  unsatisfiable,  and  C 
is  unsatisfiable  iff  C  C  _L.  Furthermore,  a  concept  C  is  satisfiable  iff  the  Abox 
{a:C}  is  consistent. 

It  is  straightforward  to  extend  these  reductions  to  Rboxes  and  Tboxes.  In 
contrast,  the  reduction  of  inference  problems  w.r.t.  a  Tbox  to  pure  concept 
inference  problems  (possibly  w.r.t.  a  role  hierarchy),  deserves  special  care:  in 
(2;  11;  3),  the  internalisation  of  GCIs  is  introduced,  a  technique  that  realises 
exactly  this  reduction.  For  S7ZXQ,  this  technique  can  be  modified  accordingly. 

Now,  note  also  that,  instead  of  having  a  role  assertion  Ref  (I?)  G  7 Za,  we 
can  add,  equivalently,  the  GCI  37?. T  O  37?. Self  to  T,  which  can  in  turn  be 
internalised.  Likewise,  instead  of  asserting  lrr(7?),  we  can,  equivalently,  add  the 
GCI  T  O  -iBR. Self.  Thus,  we  arrive  at  the  following  theorem: 

Theorem  1.  1.  Satisfiability  and  subsumption  of  S7ZXQ- concepts  w.r.t.  Tboxes 

and  Rboxes  are  polynomially  reducible  to  (un) satisfiability  of  STZX Q- concepts 
w.r.t.  Rboxes. 

2.  Consistency  of  STZXQ-Aboxes  w.r.t.  Tboxes  and  Rboxes  is  polynomially  re¬ 
ducible  to  consistency  of  STZXQ-Aboxes  w.r.t.  Rboxes. 

3.  W.l.o.g.,  we  can  assume  that  Rboxes  do  not  contain  role  assertions  of  the 
form  lrr(7?),  Ref(7?),  Tra(7?),  or  Sym(7?). 

With  Theorem  1,  all  standard  inference  problems  for  57?XQ-concepts  and 
Aboxes  can  be  reduced  to  the  problem  of  determining  the  consistency  of  a 
STZXQ-Abox  w.r.t.  to  an  Rbox,  where  we  can  assume  w.l.o.g.  that  all  role  as¬ 
sertions  in  the  Rbox  are  of  the  form  Dis(7?,  S) — we  call  such  an  Rbox  reduced. 
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3  S1ZTQ  is  Decidable 

As  we  have  just  seen,  we  can  restrict  our  attention  to  the  consistency  of  Aboxes 
w.r.t.  reduced  Rboxes  only.  We  have  extended  the  tableau  algorithm  for  7ZTQ 
to  SIZTQ ,  and  will  spend  the  remainder  of  this  paper  on  its  description. 

In  a  first  step,  the  tableau  algorithm  takes  a  reduced  Rbox  TZ  and  an  Abox 
A  and  builds,  for  each  possibly  inverse  role  R  occurring  in  TZ  or  A,  a  non- 
deterministic  finite  automaton  Bn-  Intuitively,  such  an  automaton  is  used  to 
memorise  the  path  between  an  object  x  that  has  to  satisfy  a  concept  of  the  form 
\/R.C  and  other  objects,  and  then  to  determine  which  of  these  objects  must  sat¬ 
isfy  C.  The  following  proposition  states  that  Br  indeed  captures  all  implications 
between  (paths  of)  roles  and  R  that  are  consequences  of  the  role  hierarchy  TZh, 
where  L(Br)  denotes  the  language  (a  set  of  strings  of  roles)  accepted  by  Br. 

Proposition  1.  T  is  a  model  of  TZh  if  and  only  if,  for  each  (possibly  inverse) 
role  R  occurring  in  TZh,  each  word  w  £  L(Br),  and  each  ( x,  y )  £  wx ,  we  have 
{ x,y )  £  R1  ■ 

Since  Aboxes  usually  involve  several  individuals  with  arbitrary  role  relation¬ 
ships  between  them,  the  completion  algorithm  presented  works  on  forests  rather 
than  on  trees.  A  forest  is  a  collection  of  trees  whose  root  nodes  correspond  to 
the  individuals  appearing  in  the  input  Abox  and  which  form  an  arbitrarily  con¬ 
nected  graph  according  to  the  role  assertions  stated  in  the  Abox.  Similar  as  for 
TZIQ ,  we  define  a  set  fclos(A,7\l)  of  “relevant  sub-concepts”  of  those  concepts 
occurring  in  A;  see  (6)  for  details. 

Definition  7.  A  completion  forest  F  for  a  STZIQ-Abox  A  and  an  Rbox  7 Z  is 
a  collection  of  trees  whose  distinguished  root  nodes  can  be  connected  arbitrarily. 
Moreover,  each  node  x  is  labelled  with  a  set  L(x)  C  fclos(A,7?.)  and  each  edge 
(x,  y )  from  a  node  x  to  its  successor  y  is  labelled  with  a  non-empty  set  L({x,  y)) 
of  (possibly  inverse  and  possibly  negated)  roles  occurring  in  A  and  7 Z.  Finally, 
completion  forests  come  with  an  explicit  inequality  relation  fb  on  nodes  which 
is  implicitly  assumed  to  be  symmetric. 

Let  x  and  y  be  nodes  in  F  and  R  a  role.  If  R'  \AR.  and  R'  €  L((x,y)),  then 
■y  is  called  an  R-successor  of  x. 

If  y  is  an  R-successor  of  x  or  x  is  an  In \/(R)-successor  of  y,  then  y  is  called 
an  R-neighbour  of  x.  Moreover,  a  node  x  is  a  neighbour  of  y,  if  it  is  an 
R-neighbour  for  some  role  R.  Successors,  predecessors,  ancestors,  and  de¬ 
scendants  are  defined  as  usual. 

For  a  role  S,  a  concept  C ,  and  a  node  x  in  F,  we  define  SF(x,  C)  by 
SF(x,C)  :=  {y  |  y  is  an  S-neighbour  of  x  and  C  £  L(y)j. 

A  node  is  blocked  iff  it  is  either  directly  or  indirectly  blocked.  A  node  x  is 
directly  blocked  iff  none  of  its  ancestors  are  blocked,  and  it  has  ancestors  x' , 
■y  and  y'  such  that 

1.  none  of  x' ,  y  and  y'  is  a  root  node, 
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2.  x  is  a  successor  of  x'  and  y  is  a  successor  of  y'  and 

3.  £j(x)  =  L{y)  and  L(x')  =  £(t/)  and 
4-  &{{x',x))  =  L{(y',y)). 

In  this  case,  we  say  that  y  blocks  x. 

A  node  y  is  indirectly  blocked  if  one  of  its  ancestors  is  blocked. 

Given  a  non-empty  STZTQ-Abox  A  and  a  reduced  Rbox  TZ,  the  tableau  algo¬ 
rithm  is  initialised  with  the  completion  forest  F  a, TZ  defined  as  follows: 

—  for  each  individual  a  occurring  in  A,  F a,tz  contains  a  root  node  xa, 

—  if  (a,b) :  R  £  A  or  (a,  b) :  -<R  £  A,  then  F_4  -jz  contains  an  edge  (xa,  xf), 

—  if  a  fk  b  £  A,  then  xa  fk  Xh  is  in  Fjr, 

—  Z(xa)  :={C\a:C  £A],  and 

—  £((xa,  Xb))  :=  {R\  (a,  b):R£  A}  U  R  \  (a,  b) :  ->R  £  A}. 

A  completion  forest  F  is  said  to  contain  a  clash  if  there  are  nodes  x  and  y 
such  that 

1.  i  S  tb{x),  or 

2.  for  some  concept  name  A,  {A,^A}  C  L(x),  or 

3.  x  is  an  S-neighbour  of  x  and  -GS.Self  £  L(x),  or 

4-  x  and  y  are  root  nodes,  y  is  an  R-neighbour  of  x,  and  -*R  £  L((x,y}),  or 

5.  there  is  some  Dis (R,S)  £  7 Za  and  y  is  an  R-  and  an  S-neighbour  of  x,  or 

6.  there  is  some  concept  nS.C )  £  H(x)  and  {yo, . . . ,  yn}  C  SF(x,C )  with 
Vi  fk  yj  for  all  0  <  i  <  j  <  n. 

A  completion  forest  that  does  not  contain  a  clash  is  called  clash-free.  A  com¬ 
pletion  forest  is  complete  if  none  of  the  rules  from  Figure  1  can  be  applied  to 
it. 

When  started  with  a  non-empty  Abox  A  and  a  reduced  Rbox  TZ,  the  tab¬ 
leau  algorithm  initialises  F a,tz  and  repeatedly  applies  the  expansion  rides  from 
Figure  1  to  it,  stopping  when  a  clash  occurs,  and  applying  the  shrinking  rides 
eagerly,  i.e.,  the  <-  and  the  <r-rule  are  applied  with  highest  priority.  The  algo¬ 
rithm  answers  “A  is  satisfiable  w.r.t.  TZ”  iff  the  expansion  rules  can  be  applied 
in  such  a  way  that,  they  yield  a  complete  and  clash-free  completion  forest,  and 
“A  is  unsatisfiable  w.r.t.  TZ”  otherwise. 

Lemma  1.  Let  A  be  a  SIZIQ-Abox  where  all  concepts  are  in  negation  normal 
form  and  TZ  a  reduced  Rbox. 

—  The  tableau  algorithm  terminates  when  started  for  A  and  TZ. 

—  The  expansion  rules  can  be  applied  to  A  and  TZ  such  that,  they  yield  a  complete 
and  clash-free  completion  forest  iff  there  is  a  tableau  for  A  w.r.t..  TZ. 

From  Theorem  1  and  Lemma  1,  we  thus  have  the  following  theorem: 

Theorem  2.  The  tableau  algorithm  decides  satisfiability  and  subsumption  of 
STZ1Q- concepts  with  respect,  to  Aboxes,  Rboxes,  and  Tboxes. 
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n-rule:  if  C\  n  C2  £  £(*),  x  is  not  indirectly  blocked,  and  {Ci,  C2}  2  £(*), 
then  £(*)  — >  £(*)  U  {Ci,  C2} 

U-rule:  if  C\  U  C2  £  E(x),  x  is  not  indirectly  blocked,  and 

{Ci,  C2}  n  £(*)  =  0 

then  £(*)  — >  £(®)  U  {E}  for  some  E  £  {Ci,  C2} 

3-rule:  if  BS.C  £  £(®),  x  is  not  blocked,  and 

x  has  no  S-neighbour  y  with  C  £  £(1/) 
then  create  a  new  node  y  with 

_ £((g,2/))  :=  {-S'}  and  £(y)  :=  {C} _ 

Self-rule:  if  BS.Self  £  £(*),  x  is  not  blocked,  and  S  ^  £((*,*}) 
then  add  an  edge  (x,  x)  if  it  does  not  yet  exist,  and 
set  £((*,  x))  — >  £((*,  x))  U  {S'} 

Vi-rule:  ifVS.C  G  £(x),  x  is  not  indirectly  blocked,  and 
VZ3S.C  <£L(x) 

then  £(*)  — >  £(*)  U  {V£>s.C} 

V2-rule:  if  VB(p).C  G  £(*),  x  is  not  indirectly  blocked,  p  — >  q  in  B(p), 
and  there  is  an  S-neighbour  y  of  x  with  \/B(q).C  ^  £(1/), 

_ then  L(y)  — >  £(y)  U  {\/B(q).C} _ 

V3-rule:  if  \/B.C  G  £(*),  x  is  not  indirectly  blocked,  e  G  L(B),  and  C  2  £(*) 
then  L(x)  — >  £(*)  U  {C} 

choose-rule:  if  nS.C )  G  £(*),  x  is  not  indirectly  blocked,  and 

there  is  an  S-neighbour  y  of  x  with  {C,  GC}  fl  £(t/)  =  0 
then  £(j/)  — >  £(3/)  U  {E}  for  some  E  G  {C,  GC} 

^-rule:  if  nS.C )  G  £(x),  x  is  not  blocked,  and 

there  are  no  yi , . . . ,  yn  G  SF  (x,  C) 
with  yi  2  Vj  for  each  1  <  i  <  j  <  n 
then  create  n  new  successors  3/1, . . . ,  yn  of  a:  with  £((®,  yi))  =  {S}, 

_ £(3/1)  =  {C},  and  yt  2  Vj  for  1  <  i  <  j  <  n. _ 

^-rule:  if  nS.C )  G  £(*),  x  is  not  indirectly  blocked,  and 

#SF  (x,  C)  >  n,  there  are  y,  2  G  SF(x,  C)  with 
not  y  2  2  and  y  is  not  a  root  node  nor  an  ancestor  of  2, 
then  1.  £(2)  — >  £(2)  U  £(1/)  and 

2.  if  2  is  an  ancestor  of  x 

then  £((2,*))  — ♦  L((z,x))  U  lnv(£((*,  3/))) 
else  £j((x,z)) — >  £((x,  2))  U  £((*,  1/}) 

3.  Set  u  2  z  for  all  u  with  U  2  ?/• 

4.  remove  y  and  the  sub-tree  below  y  from  F. 

<r-mlc:  if  {^nS.C)  G  £(x),  #Sb'{x,C)  >  n, 

and  there  are  two  root  nodes  y,  z  G  SF(®,  C)  with  not  y  2  2, 
then  1.  £(2)  — ♦  £(2)  U  £(3/)  and 

2.  For  all  edges  (y,w): 

i.  if  the  edge  (z,w)  does  not  exist,  create  it  with  £((2,111})  :=  £((3/,  in)); 

ii.  else  £((2, 10))  — >  £((2,  in})  U  £((3/,  in}). 

3.  For  all  edges  ( w,y ): 

i.  if  the  edge  (in,  2}  does  not  exist,  create  it  with  £((in,  2})  :=  £((in,  3/}); 

ii.  else  £((in,  2))  — >  £((in,  2))  U  £((in,  y)). 

4.  Set  u  2  z  for  all  u  with  n  2  ?/• 

5.  Remove  y  and  all  incoming  and  outgoing  edges  from  y  from  F. 


Fig.  1.  The  Expansion  Rules  for  the  S1ZTQ  Tableau  Algorithm. 
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Abstract.  Description  Logics  (DLs)  are  a  family  of  logic  based  knowl¬ 
edge  representation  formalisms.  Although  they  have  a  range  of  appli¬ 
cations  (e.g.,  configuration  and  information  integration),  they  are  per¬ 
haps  best  known  as  the  basis  for  widely  used  ontology  languages  such 
as  OWL  (now  a  W3C  recommendation).  This  decision  was  motivated 
by  a  requirement  that  key  inference  problems  be  decidable,  and  that  it 
should  be  possible  to  provide  reasoning  services  to  support  ontology  de¬ 
sign  and  deployment.  Such  reasoning  services  are  typically  provided  by 
highly  optimised  implementations  of  tableaux  decision  procedures;  these 
have  proved  to  be  effective  in  applications  in  spite  of  the  high  worst  case 
complexity  of  key  inference  problems.  The  increasing  use  of  DL  based 
ontologies  in  areas  such  as  e-Science  and  the  Semantic  Web  is,  however, 
already  stretching  the  capabilities  of  existing  DL  systems,  and  brings 
with  it  a  range  of  research  challenges. 


1  Introduction 

Description  Logics  (DLs)  are  a  family  of  class  (concept)  based  knowledge  repre¬ 
sentation  formalisms.  They  are  characterised  by  the  use  of  various  constructors 
to  build  complex  concepts  from  simpler  ones,  an  emphasis  on  the  decidability  of 
key  reasoning  tasks,  and  by  the  provision  of  sound,  complete  and  (empirically) 
tractable  reasoning  services. 

Although  they  have  a  range  of  applications  (e.g.,  reasoning  with  database 
schemas  and  queries  [1-3]),  DLs  are  perhaps  best  known  as  the  basis  for  on¬ 
tology  languages  such  as  OIL,  DAML+OIL  and  OWL  [4].  The  decision  to  base 
these  languages  on  DLs  was  motivated  by  a  requirement  not  only  that  key  infer¬ 
ence  problems  (such  as  class  satisfiability  and  subsumption)  be  decidable,  but 
that  “practical”  decision  procedures  and  “efficient”  implemented  systems  also 
be  available. 

That  DLs  were  able  to  meet  the  above  requirements  was  the  result  of  exten¬ 
sive  research  within  the  DL  community  over  the  course  of  the  preceding  20  years 
or  more.  This  research  mapped  out  a  complex  landscape  of  languages,  exploring 
a  range  of  different  language  constructors,  studying  the  effects  of  various  com¬ 
binations  of  these  constructors  on  decidability  and  worst  case  complexity,  and 
devising  decision  procedures,  the  latter  often  being  tableaux  based  algorithms. 
At  the  same  time,  work  on  implementation  and  optimisation  techniques  demon¬ 
strated  that,  in  spite  of  the  high  worst  case  complexity  of  key  inference  problems 


(usually  at  least  ExpTime),  highly  optimised  DL  systems  were  capable  of  pro¬ 
viding  practical  reasoning  support  in  the  typical  cases  encountered  in  realistic 
applications. 

With  the  added  impetus  provided  by  the  OWL  standardisation  effort,  DL 
systems  are  now  being  used  to  provide  computational  services  for  a  rapidly 
expanding  range  of  ontology  tools  and  applications  [5-10].  The  increasing  use 
of  DL  based  ontologies  in  areas  such  as  e-Science  and  the  Semantic  Web  is, 
however,  already  stretching  the  capabilities  of  existing  DL  systems,  and  brings 
with  it  a  range  of  research  challenges. 

2  Ontologies  and  Ontology  Reasoning 

In  Computer  Science,  an  ontology  is  usually  taken  to  mean  a  conceptual  model 
(of  some  domain),  typically  realised  as  a  hierarchical  vocabulary  of  terms,  to¬ 
gether  with  formal  specifications  of  the  meaning  of  each  term.  These  specifica¬ 
tions  are  often  given  with  reference  to  other  (simpler)  terms  in  the  ontology.  For 
example,  in  a  medical  terminology  ontology,  the  meaning  of  the  term  Gastritis 
might  be  specified  as  an  InfiammatoryProcess  whose  outcome  is  InflammationOf- 
Stomach,  where  InfiammatoryProcess ,  outcome  and  InflammationOfStomach  are 
all  terms  from  the  ontology.  Such  vocabularies  may  be  used,  e.g.,  to  facilitate 
data  sharing  and  reuse  (often  by  annotating  data  using  terms  from  a  shared 
ontology),  to  structure  data,  or  simply  to  explicate  and  investigate  knowledge  of 
a  domain. 

Ontologies  play  a  major  role  in  the  Semantic  Web  (where  they  are  used  to 
annotate  web  resources)  [11,12],  and  are  widely  used  in,  e.g.,  knowledge  manage¬ 
ment  systems,  e-Science,  and  bio-informatics  and  medical  terminologies  [13-16]. 
They  are  also  of  increasing  importance  in  the  Grid,  where  they  may  be  used, 
e.g.,  to  support  the  discovery,  execution  and  monitoring  of  Grid  services  [17-19]. 

Given  the  formal  and  compositional  nature  of  ontologies,  it  is  natural  to  use 
logics  as  the  basis  for  ontology  languages — this  allows  for  the  precise  definition 
of  the  meaning  of  compositional  operators  (such  as  “and”  and  “or”),  and  of  re¬ 
lationships  between  terms  (such  as  “subclass”  and  “instance”).  The  effective  use 
of  logic  based  ontology  languages  in  applications  will,  however,  critically  depend 
on  the  provision  of  efficient  reasoning  support.  On  the  one  hand,  such  support 
is  required  by  ontology  engineers  in  order  to  help  them  to  design  and  maintain 
sound,  well-balanced  ontologies  [20] .  On  the  other  hand,  such  support  is  required 
by  applications  in  order  to  exploit  the  formal  specification  of  meaning  captured 
in  ontologies:  querying  ontologies  and  ontology  structured  data,  is  equivalent  to 
computing  logical  entailments  [21]. 

3  Ontology  Languages  and  Description  Logics 

The  OWL  recommendation  actually  consists  of  three  languages  of  increasing 
expressive  power:  OWL  Lite,  OWL  DL  and  OWL  Full.  Like  OWL’s  predecessor 
DAML+OIL,  OWL  Lite  and  OWL  DL  are  basically  very  expressive  description 


logics  with  an  RDF  syntax.  OWL  Full  provides  a  more  complete  integration 
with  RDF,  but  its  formal  properties  are  less  well  understood,  and  key  inference 
problems  would  certainly  be  much  harder  to  compute.1  For  these  reasons,  OWL 
Full  will  not  be  considered  here. 

More  precisely,  OWL  DL  is  based  on  the  STLOXQ  DL  [23];  it  restricts  the 
form  of  number  restrictions  to  be  unqualified  (see  [24]),  and  adds  a  simple  form 
of  Datatypes  (often  called  concrete  domains  in  DLs  [25]).  Following  the  usual  DL 
naming  conventions,  the  resulting  logic  is  called  STLOXMfD) ,  with  the  different 
letters  in  the  name  standing  for  (sets  of)  constructors  available  in  the  language:  S 
stands  for  the  basic  ACC  DL  (equivalent  to  the  propositional  modal  logic  K(mj) 
extended  with  transitive  roles  [22],  TL  stands  for  role  hierarchies  (equivalently, 
inclusion  axioms  between  roles),  O  stands  for  nominals  (classes  whose  extension 
is  a  single  individual)  [26],  M  stands  for  unqualified  number  restrictions  and 
(D)  stands  for  datatypes)  [27].  OWL  Lite  is  equivalent  to  the  slightly  simpler 
STLXLFfD)  DL  (i.e.,  STLOXQ  without  nominals,  and  with  only  functional  number 
restrictions). 

These  equivalences  allow  OWL  to  exploit  the  considerable  existing  body  of 
description  logic  research,  e.g.: 

—  to  define  the  semantics  of  the  language  and  to  understand  its  formal  prop¬ 
erties,  in  particular  the  decidability  and  complexity  of  key  inference  prob¬ 
lems  [28]; 

—  as  a  source  of  sound  and  complete  algorithms  and  optimised  implementation 
techniques  for  deciding  key  inference  problems  [29,  22,  27] ; 

—  to  use  implemented  DL  systems  in  order  to  provide  (partial)  reasoning  sup¬ 
port  [30-32]. 


3.1  STLOXM  Syntax  and  Semantics 

The  syntax  and  semantics  of  STLOXM  are  briefly  introduced  here  (we  will  ignore 
datatypes,  as  adding  a  datatype  component  would  complicate  the  presentation 
and  has  little  affect  on  reasoning  [33]). 

Definition  1.  Let  R  be  a  set  of  role  names  with  both  transitive  and  normal  role 
names  R+  U  Rp  =  R,  where  Rp  n  R+  =  0.  The  set  of  STLOXM  -roles  (or  roles 
for  short)  is  R  U  {R~  \  R  €  R}.  A  role  inclusion  axiom  is  of  the  form  R  C  S, 
for  two  roles  R  and  S.  A  role  hierarchy  is  a  finite  set  of  role  inclusion  axioms. 

An  interpretation  X  =  (A1 ,  -T)  consists  of  a  non-empty  set  A1 ,  called  the 
domain  of  X,  and  a  function  -1  which  maps  every  role  to  a  subset  of  A1  x  A1 
such  that,  for  P  6  R  and  R  €  R+ , 

(x,y)  e  P 1  iff  (y,  x)  €  P~T , 
and  if  (x,y)  6  Rx  and  ( y,z )  6  Rx ,  then  (x,z)  6  R1 . 

1  Inference  in  OWL  Full  is  clearly  undecidable  as  OWL  Full  does  not  include  restric¬ 
tions  on  the  use  of  transitive  properties  which  are  required  in  order  to  maintain 
decidability  [22], 


An  interpretation  T  satisfies  a  role  hierarchy  TZ  iff  R x  C  S1  for  each  ROSE  71; 
such  an  interpretation  is  called  a  model  of  TZ. 

Definition  2.  Let  Nc  be  a  set  of  concept  names  with  a  subset  Nj  C  Nc  of 
nominals.  The  set  of  STL OTAf  -concepts  (or  concepts  for  short)  is  the  smallest 
set  such  that 

1.  every  concept  name  C  E  Nc  is  a  concept, 

2.  if  C  and  D  are  concepts  and  R  is  a  role,  then  ( C  n  D),  (C  U  D), 
(WR.C),  and  (3 R.C)  are  also  concepts  (the  last  two  are  called  universal  and 
existential  restrictions,  resp.),  and 

3.  if  R  is  a  simple  role2  and  n  E  N,  then  ^ nR  and  ~^nR.  are  also  concepts 
(called  almost  and  atleast  number  restrictions). 

The  interpretation  function  -x  of  an  interpretation  T  =  (Ax ,  •;r)  maps,  addition¬ 
ally,  every  concept  to  a  subset  of  A1  such  that 

(C  n  D)1  =  Cx  n  D1,  (C  U  D)1  =  Cx  U  D1,  -~CX  =  AX\CX, 
j \ox  =  1  for  all  o  E  Nj , 

(3 R.C)1  =  {x  E  A1  |  There  is  a  y  E  Ax  with  {x,y}  E  Rx  and  y  E  Cx}, 

(WR.C)1  =  {x  6  Ax  |  For  all  y  E  A1 ,  if  (x,y)  E  Rx ,  then  y  E  Cx}, 

C nR1  =  {x  E  Ax  \  jj {y  I  (x,  y)  E  R1}  ^  n}, 

^ nR1  =  [xE  Ax  \  ft {y  \  (x,  y)  E  R1}  ^  n}, 

where,  for  a  set  M,  we  denote  the  cardinality  of  M  by  JJM. 

For  C  and  D  (possibly  complex)  concepts,  C  Q  D  is  called  a  general  concept 
inclusion  (GCI),  and  a  finite  set  of  GCIs  is  called  a  TBox. 

An  interpretation  T  satisfies  a  GCI  C  t  D  if  Cx  C  Dx ,  and  T  satisfies  a 
TBox  T  ifT  satisfies  each  GCI  in  T ;  such  an  interpretation  is  called  a  model 
of  T. 

A  concept  C  is  called  satisfiable  with  respect  to  a  role  hierarchy  TZ  and  a 
TBox  T  if  there  is  a  model  T  of  TZ  and  T  with  Cx  ^  0.  Such  an  interpretation  is 
called  a  model  of  C  w.r.t.  7Z  and  T .  A  concept  D  subsumes  a  concept  C  w.r.t. 
TZ  and  T  (written  C  Etc.t  D)  if  Cx  C  Dx  holds  in  every  model  I  of  7Z  and  T. 
Two  concepts  C,D  are  equivalent  w.r.t.  TZ  and  T  (written  C  =u,t  D)  iff  they 
are  mutually  subsuming  w.r.t.  TZ  and  T .  (When  TZ  and  T  are  obvious  from  the 
context,  we  will  often  write  C  C  D  and  C  =  D.)  For  an  interpretation  T,  an 
individual  x  E  A1  is  called  an  instance  of  a  concept  C  iff  x  E  Cx . 

Note  that,  as  usual,  subsumption  and  satisfiability  can  be  reduced  to  each 
other,  and  reasoning  w.r.t.  general  TBoxes  and  role  hierarchies  can  be  reduced 
to  reasoning  w.r.t.  role  hierarchies  only  [22,27]. 

2  A  role  is  simple  if  it  is  neither  transitive  nor  has  any  transitive  subroles.  Restricting 
number  restrictions  to  simple  roles  is  required  in  order  to  yield  a  decidable  logic  [22]. 


3.2  Practical  Reasoning  Services 

Most  modern  DL  systems  use  tableaux  algorithms  to  test  concept  satisfiability. 
These  algorithms  work  by  trying  to  construct  (a  tree  representation  of)  a  model 
of  the  concept,  starting  from  an  individual  instance.  Tableaux  expansion  rules 
decompose  concept  expressions,  add  new  individuals  (e.g.,  as  required  by  3R.C 
terms),3  and  merge  existing  individuals  (e.g.,  as  required  by  ^ nR.C  terms).  Non¬ 
determinism  (e.g.,  resulting  from  the  expansion  of  disjunctions)  is  dealt  with  by 
searching  the  various  possible  models.  For  an  unsatisfiable  concept,  all  possible 
expansions  will  lead  to  the  discovery  of  an  obvious  contradiction  known  as  a 
clash  (e.g.,  an  individual  that  must  be  an  instance  of  both  A  and  —>A  for  some 
concept  A);  for  a  satisfiable  concept,  a  complete  and  clash-free  model  will  be 
constructed  [34]. 

Tableaux  algorithms  have  many  advantages.  It  is  relatively  easy  to  design 
provably  sound,  complete  and  terminating  algorithms,  and  the  basic  technique 
can  be  extended  to  deal  with  a  wide  range  of  class  and  role  constructors.  More¬ 
over,  although  many  algorithms  have  a  higher  worst  case  complexity  than  that  of 
the  underlying  problem,  they  are  usually  quite  efficient  at  solving  the  relatively 
easy  problems  that  are  typical  of  realistic  applications. 

Even  in  realistic  applications,  however,  problems  can  occur  that  are  much 
too  hard  to  be  solved  by  naive  implementations  of  theoretical  algorithms.  Mod¬ 
ern  DL  systems,  therefore,  include  a  wide  range  of  optimisation  techniques,  the 
use  of  which  has  been  shown  to  improve  typical  case  performance  by  several  or¬ 
ders  of  magnitude  [29,35,36,32,37,38].  Key  techniques  include  lazy  unfolding, 
absorption  and  dependency  directed  backtracking. 

Lazy  Unfolding  In  an  ontology,  or  DL  Tbox,  large  and  complex  concepts  are 
seldom  described  monolithically,  but  are  built  up  from  a  hierarchy  of  named 
concepts  whose  descriptions  are  less  complex.  The  tableaux  algorithm  can  take 
advantage  of  this  structure  by  trying  to  find  contradictions  between  concept 
names  before  adding  expressions  derived  from  Tbox  axioms.  This  strategy  is 
known  as  lazy  unfolding  [29,36]. 

The  benefits  of  lazy  unfolding  can  be  maximised  by  lexically  normalising  and 
naming  all  concept  expressions  and,  recursively,  their  sub-expressions.  An  ex¬ 
pression  C  is  normalised  by  rewriting  it  in  a  standard  form  (e.g.,  disjunctions  are 
rewritten  as  negated  conjunctions);  it  is  named  by  substituting  it  with  a  new  con¬ 
cept  name  A,  and  adding  an  axiom  A  =  C  to  the  Tbox.  The  normalisation  step 
allows  lexically  equivalent  expressions  to  be  recognised  and  identically  named, 
and  can  even  detect  syntactically  “obvious”  satisfiability  and  unsatisfiability. 

Absorption  Not  all  axioms  are  amenable  to  lazy  unfolding.  In  particular,  so 
called  general  concept  inclusions  (GCIs),  axioms  of  the  form  C  C  D  where  C 
is  non-atomic,  must  be  dealt  with  by  explicitly  making  every  individual  in  the 

3  Cycle  detection  techniques  known  as  blocking  may  be  required  in  order  to  guarantee 
termination. 


model  an  instance  of  D\J->C.  Large  numbers  of  GCIs  result  in  a  very  high  degree 
of  non-determinism  and  catastrophic  performance  degradation  [36]. 

Absorption  is  another  rewriting  technique  that  tries  to  reduce  the  number 
of  GCIs  in  the  Tbox  by  absorbing  them  into  axioms  of  the  form  ACC,  where 
A  is  a  concept  name.  The  basic  idea  is  that  an  axiom  of  the  form  An  D  Q  D' 
can  be  rewritten  as  A  C  D'  U  ->  D  and  absorbed  into  an  existing  ACC  axiom 
to  give  ACC  n  ( D '  U  ->  D)  [39].  Although  the  disjunction  is  still  present,  lazy 
unfolding  ensures  that  it  is  only  applied  to  individuals  that  are  already  known 
to  be  instances  of  A. 

Dependency  Directed  Backtracking  Inherent  unsatishability  concealed  in 
sub-expressions  can  lead  to  large  amounts  of  unproductive  backtracking  search 
known  as  thrashing.  For  example,  expanding  the  expression  (CiU-Di)n. .  ,n(C„U 
Dn)  n  31?. (A  nB)n  VR.-iA  could  lead  to  the  fruitless  exploration  of  2"  possible 
expansions  of  (C\  U  Di)  Cl  ...  13  ( Cn  U  Dn)  before  the  inherent  unsatishability 
of  3R.  (A  nB)n  \/R.->A  is  discovered.  This  problem  is  addressed  by  adapting  a 
form  of  dependency  directed  backtracking  called  backjumping,  which  has  been 
used  in  solving  constraint  satisfiability  problems  [40]. 

Backjumping  works  by  labelling  concepts  with  a  dependency  set  indicating 
the  non-deterministic  expansion  choices  on  which  they  depend.  When  a  clash  is 
discovered,  the  dependency  sets  of  the  clashing  concepts  can  be  used  to  identify 
the  most  recent  non-deterministic  expansion  where  an  alternative  choice  might 
alleviate  the  cause  of  the  clash.  The  algorithm  can  then  jump  back  over  inter¬ 
vening  non-deterministic  expansions  without  exploring  any  alternative  choices. 
Similar  techniques  have  been  used  in  hrst  order  theorem  provers,  e.g.,  the  “proof 
condensation”  technique  employed  in  the  HARP  theorem  prover  [41]. 

4  Research  Challenges  for  Ontology  Reasoning 

The  development  of  the  OWL  language,  and  the  successful  use  of  reasoning 
systems  in  tools  such  as  the  Protege  editor  [42],  has  demonstrated  the  utility  of 
logic  and  automated  reasoning  in  the  ontology  domain.  The  increasing  use  of  DL 
based  ontologies  in  areas  such  as  e-Science  and  the  Semantic  Web  is,  however, 
already  stretching  the  capabilities  of  existing  DL  systems,  and  brings  with  it  a 
range  of  challenges  for  future  research. 

Scalability  Practical  ontologies  may  be  very  large — tens  or  even  hundreds  of 
thousands  of  classes.  Dealing  with  large-scale  ontologies  already  presents  a  chal¬ 
lenge  to  the  current  generation  of  DL  reasoners,  in  spite  of  the  fact  that  many 
existing  large-scale  ontologies  are  relatively  simple.  In  the  40,000  concept  Gene 
Ontology  (GO),  for  example,  much  of  the  semantics  is  currently  encoded  in  class 
names  such  as  “heparin- metabolism” ;  enriching  GO  with  more  complex  defini- 
tions,  e.g.,  by  explicitly  modelling  the  fact  that  heparin-metabolism  is  a  kind  of 
“metabolism”  that  “acts-on”  the  carbohydrate  “heparin”,  would  make  the  se¬ 
mantics  more  accessible,  and  would  greatly  increase  the  value  of  GO  by  enabling 


new  kinds  of  query  such  as  “what  biological  processes  act  on  glycosaminoglycan” 
(heparin  is  a  kind  of  glycosaminoglycan)  [43].  However,  adding  more  complex 
class  definitions  can  cause  the  performance  of  existing  reasoners  to  degrade  to 
the  point  where  it  is  no  longer  acceptable  to  users.  Similar  problems  have  been 
encountered  with  large  medical  terminology  ontologies,  such  as  the  GALEN  on¬ 
tology  [44], 

Moreover,  as  well  as  using  a  conceptual  model  of  the  domain,  many  appli¬ 
cations  will  also  need  to  deal  with  very  large  volumes  of  instance  data — the 
Gene  Ontology,  for  example,  is  used  to  annotate  millions  of  individuals,  and 
practitioners  want  to  answer  queries  that  refer  both  to  the  ontology  and  to  the 
relationships  between  these  individuals,  e.g.,  “what  DNA  binding  products  inter¬ 
act  with  insulin  receptors” .  Answering  this  query  requires  a  reasoner  not  only  to 
identify  individuals  that  are  (perhaps  only  implicitly)  instances  of  DNA  binding 
products  and  of  insulin  receptors,  but  also  to  identify  which  pairs  of  individuals 
are  (perhaps  only  implicitly)  instances  of  the  interactsWith  role.  For  existing 
ontology  languages  it  is  possible  to  use  DL  reasoning  to  answer  such  queries, 
but  dealing  with  the  large  volume  of  GO  annotated  gene  product  data  is  far 
beyond  the  capabilities  of  existing  DL  systems  [45] . 

Several  different  approaches  to  this  problem  are  already  under  investigation. 
One  of  these  involves  the  use  of  a  hybrid  DL-DB  architecture  in  which  instance 
data  is  stored  in  a  database,  and  query  answering  exploits  the  relatively  simple 
relational  structure  encountered  in  typical  data  sets  in  order  minimise  the  use 
of  DL  reasoning  and  maximise  the  use  of  database  operations  [46].  Another 
technique  that  is  under  investigation  is  to  use  reasoning  techniques  based  on 
the  encoding  of  STLXQ  ontologies  in  Datalog  [47].  On  the  one  hand,  theoretical 
investigations  of  this  technique  have  revealed  that  data  complexity  (i.e.,  the 
complexity  of  answering  queries  against  a  fixed  ontology  and  set  of  instance 
data)  is  significantly  lower  than  the  complexity  of  class  consistency  reasoning 
(i.e.,  NP-complete  for  STLXQ,  and  even  polynomial-time  for  a  slight  restriction 
of  STLXQ )  [48];  on  the  other  hand,  the  technique  would  allow  relatively  efficient 
Datalog  engines  to  be  used  to  store  and  reason  with  large  volumes  of  instance 
data. 


Expressive  Power  OWL  is  a  relatively  rich  ontology  language,  but  many  ap¬ 
plications  require  even  greater  expressive  power  than  that  which  is  provided 
by  the  existing  OWL  standard.  For  example,  in  ontologies  describing  complex 
physically  structured  domains  such  as  biology  [43]  and  medicine  [44],  it  is  often 
important  to  describe  aggregation  relationships  between  structures  and  their 
component  parts,  and  to  assert  that  certain  properties  of  the  component  parts 
transfer  to  the  structure  as  a  whole  (a  femur  with  a  fractured  shaft  is  a  frac¬ 
tured  femur)  [49] .  The  importance  of  this  kind  of  knowledge  can  be  gauged  from 
the  fact  that  various  “work-arounds”  have  been  described  for  use  with  ontology 
languages  that  cannot  express  it  directly  [50]. 

It  may  not  be  possible  to  satisfy  all  expressive  requirements  while  staying 
within  a  decidable  fragment  of  first  order  logic.  Recent  research  has,  therefore, 


studied  the  use  in  ontology  reasoning  of  semi-decision  procedures  such  as  reso¬ 
lution  based  theorem  provers  for  full  first  order  logic  [51].  There  have  also  been 
studies  of  languages  that  combine  a  DL  with  some  other  logical  formalism,  of¬ 
ten  Datalog  style  rules,  with  the  connection  between  the  two  formalisms  being 
restricted  so  as  to  maintain  decidability  [52, 47,  53] 


Extended  Reasoning  Services  Finally,  in  addition  to  solving  problems  of 
class  consistency/subsumption  and  instance  checking,  explaining  how  such  in¬ 
ferences  are  derived  may  be  important,  e.g.,  to  help  an  ontology  designer  to 
rectify  problems  identified  by  reasoning  support,  or  to  explain  to  a  user  why  an 
application  behaved  in  an  unexpected  manner. 

Work  on  developing  practical  explanation  systems  is  at  a  relatively  early 
stage,  with  different  approaches  still  being  developed  and  evaluated.  One  such 
technique  involves  exploiting  standard  reasoning  services  to  identify  a  small  set  of 
axioms  that  still  support  the  inference  in  question,  the  hope  being  that  presenting 
a  much  smaller  (than  the  complete  ontology)  set  of  axioms  to  the  user  will 
help  them  to  understand  the  “cause”  of  the  inference  [54].  Another  (possibly 
complementary)  technique  involves  explaining  the  steps  by  which  the  inference 
was  derived,  e.g.,  using  a  sequence  of  simple  natural  deduction  style  inferences 
[55,56]. 

As  well  as  explanation,  so-called  “non-standard  inferences”  could  also  be 
important  in  supporting  ontology  design;  these  include  matching,  approximation, 
and  difference  computations.  Non-standard  inferences  are  the  subject  of  ongoing 
research  [57-60];  it  is  still  not  clear  if  they  can  be  extended  to  deal  with  logics 
as  expressive  as  those  that  underpin  modern  ontology  languages,  or  if  they  will 
scale  to  large  applications  ontologies. 

5  Summary 

Description  Logics  are  a  family  of  class  based  knowledge  representation  for¬ 
malisms  characterised  by  the  use  of  various  constructors  to  build  complex  classes 
from  simpler  ones,  and  by  an  emphasis  on  the  provision  of  sound,  complete  and 
(empirically)  tractable  reasoning  services.  They  have  been  used  in  a  wide  range 
of  applications,  but  perhaps  most  notably  (at  least  in  recent  times)  in  providing 
a  formal  basis  and  reasoning  services  for  (web)  ontology  languages  such  as  OWL. 

The  effective  use  of  logic  based  ontology  languages  in  applications  will,  how¬ 
ever,  critically  depend  on  the  provision  of  efficient  reasoning  services  to  support 
both  ontology  design  and  deployment.  The  increasing  use  of  DL  based  ontologies 
in  areas  such  as  e-Science  and  the  Semantic  Web  is,  however,  already  stretching 
the  capabilities  of  existing  DL  systems,  and  brings  with  it  a  range  of  challenges 
for  future  research.  The  extended  ontology  languages  needed  in  some  applications 
may  demand  the  use  of  more  expressive  DLs,  and  even  for  existing  languages, 
providing  efficient  reasoning  services  is  extremely  challenging. 

Some  applications  may  even  call  for  ontology  languages  based  on  larger  frag¬ 
ments  of  FOL.  The  development  of  such  languages,  and  reasoning  services  to 


support  them,  extends  these  challenges  to  the  whole  logic  based  Knowledge 
Representation  community. 
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Abstract 

OWL  1.1  extends  OWL  DL  in  several  ways,  extending  the  Description  Logic  underlying  OWL  1.1  to  SROIQ,  adding 
user-defined  datatypes  and  restrictions  involving  datatype  predicates,  adding  a  punning  version  of  meta-modelling,  and 
adding  a  semantics-free  comment  mechanism.  This  document  defines  the  function-style  syntax  for  OWL  1.1,  and 
provides  an  informal  discussion  of  the  meaning  of  the  additional  constructs. 

Status  of  this  Document 

This  is  a  editor's  draft,  for  comment  by  the  OWL  community. 

When  approved,  this  document  will  serve  as  a  target  for  implementation  by  the  major  developers  of  OWL  systems. 

This  document  has  been  produced  for  approval  of  the  ad-hoc  OWL  community.  Comments,  both  from  within  and  without 
the  community,  are  welcome.  Although  the  development  of  OWL  1.1  is  not  a  W3C  activity,  public  comments  should  be 
sent  to  the  semantic-web@w3c.org  (archive)  to  ensure  widest  visibility. 
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1.  OWL  DL  Syntax 

This  is  a  recapitulation  of  the  OWL  DL  syntax,  with  some  changes  that  do  not  affect  the  language,  included  here  as  a 
basis  for  the  OWL  1.1  syntax.  The  OWL  DL  syntax  is  called  an  abstract  syntax,  which  is  not  quite  right,  so  the  syntax  is 
here  called  the  function-style  syntax  where  necessary. 

The  syntax  is  specified  here  by  means  of  a  version  of  Extended  BNF,  very  similar  to  the  EBNF  notation  used  for  XML 
[XML],  Terminals  are  quoted;  non-terminals  are  bold  and  not  quoted.  Alternatives  are  either  separated  by  vertical  bars 
(|)  or  are  given  in  different  productions.  Components  that  can  occur  at  most  once  are  enclosed  in  square  brackets  ([...]); 
components  that  can  occur  any  number  of  times  (including  zero)  are  enclosed  in  braces  ({...}).  Whitespace  is  ignored  in 
the  productions. 

1.0.  Preliminaries 

Names  in  the  syntax  are  RDF  URI  references,  \RDF  Concepts 1.  These  names  can  be  abbreviated  into  qualified  names. 
The  following  prefixes  are  ottern  used: 


Namespace  name 


Namespace 
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rdf 

http://www.w3.Org/1999/02/22-rdf-syntax-ns# 

rdfs 

http://www.w3.Org/2000/01/rdf-schema# 

xsd 

http://www.w3.Org/2001/XMLSchema# 

owl 

http://www.w3.Org/2002/07/owl# 

Names  are  divided  up  into  several  syntactic  categories: 

datatypelD  : :=  URIreference 
classID  : :=  URIreference 
individuallD  ::=  URIreference 
ontologylD  ::=  URIreference 
datavaluedPropertylD  : :=  URIreference 
individualvaluedPropertylD  : :=  URIreference 
annotationPropertylD  : :=  URIreference 
ontologyPropertylD  : :=  URIreference 

A  name  cannot  be  both  a  datatypelD  and  a  classID  in  an  ontology.  A  name  also  cannot  be  more  than  one  of  an 
datavaluedPropertylD,  an  individualvaluedPropertylD,  an  annotationPropertylD,  or  an  ontologyPropertylD  in  an  ontology. 

Further,  a  name  cannot  be  more  than  one  of  the  above  categories  in  OWL  DL.  (This  limitation  is  separated  because  it  is 
relaxed  in  OWL  1.1.) 

There  are  two  built-in  classes  in  OWL.  The  class  with  identifier  owLThing  is  the  class  of  all  individuals.  The  class  with 
identifier  owLNothing  is  the  empty  class. 

The  following  XML  Schema  datatypes  fXML  Schema  Datatypes!  can  be  used  in  OWL  as  built-in  datatypes  by  means  of 
the  XML  Schema  canonical  URI  reference  tor  the  datatype:  xsd:strinq,  xsd:boolean,  xsd:decimal,  xsd:float,  xsd:double, 
xsd:dateTime,  xsd:time,  xsd:date,  xsd:qYearMonth,  xscbqYear,  xsd:gMonthDay,  xscbqDay,  xsd:qMonth,  xsd:hexBinary, 
xsd:base64Binarv,  xsd:anvURI,  xsd:normalizedStrmq.  xscbtoken,  xscblanquaqe.  xsd:NMTOKEN,  xsd:Name,  xsd:NCName, 
xsdunteqer,  xsd:nonPositiveTntecier,  xsd:neqativelnteqer,  xsddo'nq,  xscbmt,  xsd:sTior  ,  xsd: byte,  xsdmonFIeqativelnteqer, 
xsd:  unsigned  Long,  xsd:  unsigned  I  nt,  xsd:unsiqnedShort,  xsd:  unsigned  Byte,  and  xsd:positivelnteqer. 

There  are  several  built-in  annotation  properties  in  OWL,  namely  owhversionlnfo,  rdfsdabel,  rdfs:comment,  rdfs:seeAlso, 
and  rdfs:isDefinedBy.  In  keeping  with  their  definition  in  RDF,  rdfsdabel  and  rdfs:comment  can  only  be  used  with  data 
literals. 

There  are  also  several  built-in  ontology  properties:  they  are  owbimports,  owbpriorVersion,  owhbackwardCompatibleWith, 
and  owhincompatibleWith.  Ontology  annotations  that  use  owbimports  have  the  extra  effect  of  importing  the  target 
ontology. 

1.1.  Ontologies  and  Annotations 

ontology  ::=  'Ontology!'  [  ontologylD  ]  {  directive  }  ')' 
directive  : :=  'Annotation! '  ontologyPropertylD  ontologylD  ')' 

'Annotation! 1  annotationPropertylD  URIreference  ')' 

'Annotation!'  annotationPropertylD  dataLiteral  ')' 

'Annotation!'  annotationPropertylD  individual  ')' 

axiom 

fact 

annotation  ::=  'annotation!'  annotationPropertylD  URIreference  ')' 

I  'annotation!'  annotationPropertylD  dataLiteral  ')' 

|  'annotation!'  annotationPropertylD  individual  ')' 


1.2.  Facts 


fact  : :=  individual 

individual  ::=  'Individual!'  [  individuallD  ]  {  annotation  }  {  'type!'  type  ’)'  }  {  value  }  ')' 
value  : :=  'value!'  individualvaluedPropertylD  individuallD  ')' 

I  'value!'  individualvaluedPropertylD  individual  ')' 

|  'value!'  datavaluedPropertylD  dataLiteral  ')' 

type  : :=  description 

dataLiteral  : :=  typedLiteral  |  plainLiteral> 
typedLiteral  :  :=  lexicalForm~''URIreference 
plainLiteral  : :=  lexicalForm  |  lexicalForm@languageTag 
lexicalForm  : :=  as  in  RDF,  a  Unicode  string  in  normal  form  C 
languageTag  : :=  as  in  RDF,  an  XML  language  tag 

fact  : :=  1 Samelndividual ( 1  individuallD  individuallD  {individuallD}  ')' 

|  'Differentlndividuals! '  individuallD  individuallD  {individuallD}  ')' 

1.3.  Axioms 

To  preserve  decidability  of  reasoning  in  OWL  Lite,  not  all  properties  can  have  cardinality  restrictions  placed  on  them  or 
be  specified  as  functional  or  inverse-functional.  An  individual-valued  property  is  complex  if  1/  it  is  specified  as  being 
functional  or  inverse-functional,  2/  there  is  some  cardinality  restriction  that  uses  it,  3/  it  has  an  inverse  that  is  complex, 
or  4/  it  has  a  super-property  that  is  complex.  Individual-valued  properties  that  are  not  complex  are  called  simple.  Only 
simple  properties  can  be  specified  as  being  transitive. 

1.3.1.  Class  Axioms 


axiom  :  :=  'Class!'  classID  ['Deprecated']  modality  {  annotation  }  {  description  }  ')' 
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modality  ::=  'complete'  |  'partial' 

axiom  : :=  ' EnumeratedClass ( '  classID  ['Deprecated']  {  annotation  }  {  individuallD  }  ')' 

axiom  : :=  ' DisjointClasses ( '  description  description  {  description  }  ')' 

|  ' EquivalentClasses ( '  description  {  description  }  ')' 
j  '  SubClassOf ( '  description  description  ')' 

axiom  : :=  'Datatype!'  datatypelD  ['Deprecated']  {  annotation  }  )' 


1.3.2.  Descriptions 

description  : :=  classID 

restriction 

'unionOf!'  {  description  }  ')' 

' intersectionOf ( '  {  description  }  ')' 
' complementOf ( '  description  ')' 
’oneOf('  {  individuallD  }  ')' 


1.3.3.  Restrictions 


restriction  : :=  'restriction!'  datavaluedPropertylD  dataRestrictionComponent  {  dataRestrictionComponent  }  ')' 

|  'restriction!'  individualvaluedPropertylD  individualRestrictionComponent  {  individualRestrictionCompi 
dataRestrictionComponent  ::=  'allValuesFrom! '  dataRange  ')' 

' someValuesFrom! '  dataRange  ')' 

'value!'  dataLiteral  ')' 
dataCardinality 

dataCardinality  ::=  'minCardinality! '  non-negative-integer  ')' 

I  ' maxCardinality! '  non-negative-integer  ')' 

|  'cardinality!'  non-negative-integer  ')' 
individualRestrictionComponent  ::=  'allValuesFrom!'  description  ')' 

'someValuesFrom!'  description  ')' 

'value!'  individuallD  ')' 
individualCardinality 

individualCardinality  : :=  'minCardinality!'  non-negative-integer  ')' 

|  'maxCardinality!'  non-negative-integer  ')' 
j  'cardinality!'  non-negative-integer  ')' 
dataRange  : :=  datatypelD  |  ' rdf s : Literal ' 

|  'oneOf('  {  dataLiteral  }  ')' 


1.3.4.  Property  Axioms 

axiom  : :=  ' DatatypeProperty! '  datavaluedPropertylD  ['Deprecated']  {  annotation  } 

{  'super!'  datavaluedPropertylD  ')'}  ['Functional'] 

{  'domain!'  description  ')'  }  {  'range!'  dataRange  ')'  }  ')' 

|  ' Obj ectProperty ( '  individualvaluedPropertylD  ['Deprecated']  {  annotation  } 

{  'super!'  individualvaluedPropertylD  ')'  } 

[  'inverseOf!'  individualvaluedPropertylD  ')'  ]  [  'Symmetric'  ] 

{  individualvaluedPropertyFlags  } 

{  'domain!'  description  ')'  }  {  'range!'  description  ')'  }  ')' 

I  ' AnnotationProperty ( '  annotationPropertylD  {  annotation  }  ')' 

|  ' OntologyProperty ( '  ontologyPropertylD  {  annotation  }  ')' 

individualvaluedPropertyFlags  : :=  'Functional'  |  ' InverseFunctional '  |  'Transitive' 

axiom  : :=  ' EquivalentProperties ( '  datavaluedPropertylD  datavaluedPropertylD  {  datavaluedPropertylD  }  ')' 

I  ' SubPropertyOf ( '  datavaluedPropertylD  datavaluedPropertylD  ')' 

|  'EquivalentProperties!'  individualvaluedPropertylD  individualvaluedPropertylD 

{  individualvaluedPropertylD  }  ' ) ' 

|  'SubPropertyOf!'  individualvaluedPropertylD  individualvaluedPropertylD  ')' 

2.  OWL  1.1  Syntax  Additions 

2.1.  Syntactic  Sugar 

OWL  1.1  extendes  the  syntax  of  OWL  in  two  ways  that  simply  provide  shorthand  notations  (syntactic  sugar)  for 
expressive  power  otherwise  in  OWL  DL. 

axiom  : :=  ' DisjointUnion! '  description  description  {  description  }  ')' 

value  : :=  'valueNot!'  individualvaluedPropertylD  individuallD  ')' 

I  'valueNot!'  individualvaluedPropertylD  individual  ')' 

|  'valueNot!'  datavaluedPropertylD  dataLiteral  ')' 

The  first  construct  is  simply  the  obvious  combination  of  a  DisjointClasses  of  all  the  descriptions  except  the  first  and  an 
EquivalentClasses  of  the  first  description  and  the  union  of  the  other  descriptions.  The  second  construct  is  the 
complementOf  the  restriction  of  the  property  to  the  value. 

2.2.  SROIQ 

OWL  1.1  includes  a  number  of  Description  Logic  constructs  that  raise  the  core  Description  Logic  expressive  power  from 
SHOIN  to  SROIQ  \SROIQ],  This  amounts  to  adding  qualified  cardinality  restrictions;  local  reflexivity  restrictions  for  simple 
properties;  reflexive,  irreflexive,  and  anti-symmetric  flags  for  simple  properties;  disjointness  of  simple  properties;  and 
regular  property  inclusion  axioms. 

dataCardinality  ::=  'minCardinality!'  non-negative-integer  dataRange  ')' 

|  'maxCardinality!'  non-negative-integer  dataRange  ')' 


http://www-db.research.bell-labs.com/user... 


3  of  5 


OWL  1 . 1  Web  Ontology  Language  Syntax 

|  ' cardinality! '  non-negative-integer  dataRange  ')' 

individualCardinality  : :=  'minCardinality! '  non-negative-integer  description  ')' 

I  ' maxCardinality! '  non-negative-integer  description  ')' 

|  ' cardinality! '  non-negative-integer  description  ')' 

individualRestrictionComponent  : :=  'self' 

individualvaluedPropertyFlags  : :=  'Reflexive'  |  ' Irreflexive '  |  'Symmetric'  |  'Antisymmetric' 

axiom  : :=  ' DisjointProperties ( '  dataavaluedPropertylD  datavaluedPropertylD  {  datavaluedPropertylD  }  ')' 

|  'DisjointProperties!'  individualvaluedPropertylD  individualvaluedPropertylD  {  individualvaluedPropertyl 

axiom  : :=  ' SubPropertyOf (propertyChain! '  individualvaluedPropertylD  {  individualvaluedPropertylD  }  ')' 

individualvaluedPropertylD  ')' 

Only  simple  properties  can  have  the  self  restriction  component.  Only  simple  properties  can  be  specified  as  being 
Reflexive,  Irreflexive,  Symmetric,  or  Antisymmetric.  Only  simple  properties  can  used  in  DisjointProperties  axioms  for 
individual-valued  properties. 

The  SubPropertyOf  axioms  involving  individual-valued  properties  must  be  regular.  That  is,  there  must  be  a  strict  partial 
order,  <,  on  individual-valued  properties  such  that  for  each  SubPropertyOf  axiom  involving  individual-valued  properties, 
of  the  form  SubPropertyOf!  S  R  ) 

1.  S  is  the  inverse  of  R, 

2.  S  is  of  the  form  propertyChain!  R  ...  R  ), 

3.  S  is  of  the  form  propertyChain!  SI  ...  Sn  )  and  each  Si  \lt;  R, 

4.  S  is  of  the  form  propertyChain!  R  SI  ...  Sn  )  and  each  Si  \lt;  R,  or 

5.  S  is  of  the  form  propertyChain!  SI  ...  Sn  R  )  and  each  Si  \lt;  R. 

The  meaning  of  all  these  constructs  is  the  same  as  in  SROIQ. 

2.3.  Datatypes 

OWL  1.1  includes  its  own  methods  for  user-defined  datatypes.  The  syntax  for  OWL  1.1  user-defined  datatypes  is  similar 
to  the  one  used  in  Protege.  The  semantics  for  OWL  1.1  user-defined  datatypes  is  taken  from  XML  Schema  Datatypes 
[XML  Schema  Datatypes 1. 

dataRange  : :=  'datatype!'  datatypelD  {  datatypeRestriction  }  ')' 
datatypeRestriction  : :=  datatypeFacet ' ( '  dataLiteral ' ) ' 

datatypeFacet  ::=  'length'  |  'minLength'  I  'maxLength'  |  'pattern'  |  'enumeration' 

I  ' maxlnclusive '  |  ' maxExclusive '  |  ' minlnclusive '  |  ' minExclusive ' 

|  'totalDigits '  |  ' f ractionDigits ' 

axiom  : :=  'Datatype!'  datatypelD  {  annotation  }  'base!'  datatypelD  ')'  {  datatypeRestriction  }  ')' 

Datatype  facets  should  only  by  used  where  they  would  be  allowed  in  XML  Schema  Datatypes  [XML  Schema  Datatypes 1, 
except  that  the  'length1,  'minLength1,  'maxLength1,  and  'pattern'  facets  are  not  allowed  for  numeric  types.  Datatype 
facets  have  the  same  meaning  as  in  XML  Schema  Datatypes,  except  that  they  uniformly  work  in  the  value  space,  never 
the  lexical  space.  If  a  datatype  facet  is  used  in  a  way  that  has  no  meaning,  such  as  (length  "5"~''xsd:string)  or 
(datatype  xsd:string  (maxlnclusive  "5"''''xsd:int))  then  the  datatype  extension  is  empty.  Note  that  this  means  that 
rdfs:Literal  has  no  useful  facets. 

OWL  1.1  allows  restrictions  that  relate  values  for  different  data-valued  properties  on  the  same  individual. 

restriction  : :=  'holds!'  datatypePredicatelD  {  argument  }  ')' 
restriction  : :=  datatypePropertylD  |  dataLiteral 

datatypePredicatesID  : :=  'equal'  |  'notEqual'  |  'lessThan'  |  ' lessThanEqual '  |  ' greaterThan '  |  ' greaterThanEqual ' 

The  syntax  here  allows  an  arbitrary  number  of  arguments,  but  must  be  appropriate  for  the  predicate,  and  all  the  current 
predicates  only  allow  two  arguments. 

The  base  types  of  the  values  being  compared  must  be  the  same,  although  this  may  be  liberalized  if  the  Semantic  Web 
Best  Practices  note  on  datatypes  is  generally  adopted.  All  invalid  combinations  are  unsatisfiable  (i.e.,  they  do  not  signal 
an  error).  The  equality  and  order  for  a  particular  base  type  is  taken  from  XML  Schema  Datatypes  [XML  Schema 
Datatypes],  If  a  base  datatype  does  not  have  an  order  then  the  restriction  is  unsatisfied. 

2.4.  Meta-modelling  and  annotations 

OWL  1.1  removes  the  limitation  on  names  being  only  one  of  a  class,  a  property,  or  an  individual  in  OWL  DL.  The 
semantic  change  to  allow  this  without  computational  consequences  is  to  break  the  RDF-inspired  connection  between 
class  and  property  extensions  and  the  individual  denotation  of  names. 

With  this  change,  non-annotation  properties  can  be  placed  on  any  name.  The  property  applies  to  the  use  of  the  name  as 
an  individual.  As  a  simple  syntactic  sugar,  non-annotation  properties  can  be  part  of  certain  class  and  property  axioms. 

OWL  1.1  changes  the  status  of  the  built-in  properties  rdfsdabel  and  rdfsicomment  from  annotation  properties  to 
data-valued  properties  with  no  domain  or  range. 

annotation  ::=  value 

|  'type!'  description  ')' 

A  class  or  property  axiom  with  an  annotation  is  syntactic  sugar  for  an  extra  Individual  axiom  for  the  name  with  just  the 
annotations. 
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2.5.  Comments 

OWL  1.1  allows  arbitrary  comments  to  be  inserted  in  ontologies. 

comment  : :=  ' Comment ('  {  commentEntry  }  ')' 
commentEntry  : :=  dataLiteral  |  URIreference 

A  comment  is  allowed  anywhere  white  space  is  allowed  and  acts  as  white  space. 

Comments  have  no  semantic  import  in  OWL  1.1,  but  systems  are  expected  to  keep  track  of  comments. 
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Abstract 

Many  legacy  ontologies  are  now  being  translated  into  Description 
Logic  (DL)  based  ontology  languages  in  order  to  take  advantage  of  DL 
based  tools  and  reasoning  services.  The  resulting  DL  Knowledge  Bases 
(KBs)  are  typically  of  large  size,  but  have  a  very  simple  structure,  i.e., 
they  consist  mainly  of  shallow  taxonomies.  The  classification  algorithms 
used  in  state-of-the-art  DL  reasoners  may  not  deal  well  with  such  KBs 
In  this  paper  we  propose  an  optimisation  which  dramatically  speeds-up 
classification  for  such  KBs. 


1  Introduction 

Motivated  by  the  W3C  recommendation,  and  the  availability  of  DL  based  tools 
and  reasoning  services,  many  legacy  ontologies  are  now  being  translated  into  the 
Description  Logic  (DL)  based  OWL  DL  ontology  language  [3].  The  resulting  DL 
Knowledge  Bases  (KBs)  are  typically  of  (very)  large  size,  but  often  have  only  a 
(very)  simple  structure.  In  particular,  the  sub-class  hierarchy  often  resembles  a 
taxonomy  (i.e.,  an  asserted  hierarchy  of  primitive  concepts),  and  is  very  broad 
and  shallow. 

The  classification  algorithms  used  in  state-of-the-art  DL  reasoners  (such  as 
FaCT  and  RACER)  may  not  deal  well  with  such  KBs,  mainly  due  to  the  fact  that 
some  classes  in  the  hierarchy  will  have  very  large  numbers  of  direct  sub-classes. 
For  such  KBs,  the  top-down  phase  of  the  standard  classification  algorithm  [2] 
will  perform  a  very  large  number  of  subsumption  tests,  almost  all  of  which  will 
fail  (i.e.,  conclude  that  no  subsumption  relationship  holds).  Although  the  simple 
structure  of  the  KB  means  that  the  time  required  for  each  such  test  is  small, 
the  very  large  number  of  tests  can  still  lead  to  performance  problems. 

In  this  paper  we  present  a  classification  optimisation  that  identifies  (subsets 
of)  KBs  for  which  it  is  possible  to  compute  the  concept  hierarchy  without  per¬ 
forming  any  subsumption  tests.  This  technique  is  very  effective  when  used  on 
the  kinds  of  legacy  KB  described  above  as  it  avoids  performing  large  numbers 


of  negative  subsumption  tests.  It  also  turns  out  to  produce  (smaller)  improve¬ 
ments  in  classification  times  for  some  more  complex  ontologies,  as  even  in  these 
ontologies  a  significant  part  of  the  upper  level  structure  often  resembles  a  tax¬ 
onomy. 

The  drawback  of  this  new  optimisation  is  that  it  is  only  applicable  to  purely 
“definitional”  knowledge  bases,  i.e. ,  those  that  do  not  contain  any  general  con¬ 
cept  inclusion  axioms  (GCIs).  However,  the  well  known  absorption  optimisation 
has  already  been  shown  to  be  able  to  eliminate  GCIs  from  typical  ontologies  [6] 
and,  moreover,  when  absorption  fails  to  eliminate  GCIs,  we  may  already  expect 
serious  reasoning  performance  problems  from  other  sources  (i.e.,  the  hardness 
of  individual  subsumption  tests).  It  is  also  only  applicable  to  that  portion  of 
a  KB  which  has  the  necessary  simple  structure,  but  this  is  often  a  major  part 
of  legacy  KBs  and,  as  mentioned  above,  is  usually  a  significant  part  of  more 
complex  KBs. 

2  Preliminaries 

Description  Logics  are  concept  (class)  based  knowledge  representation  systems. 
They  are  (usually)  decidable  fragments  of  First  Order  Predicate  Calculus,  and 
have  a  have  standard  first  order  style  model  theoretic  semantics  [1] .  The  formal 
specification  of  semantics  coupled  with  decidability  allows  for  the  design  of  deci¬ 
sion  procedures  (sound,  complete  and  terminating  algorithms)  for  key  reasoning 
tasks  such  as  concept  subsumption. 

A  DL  Knowledge  Base  is  often  thought  of  as  consisting  of  two  parts:  a  Tbox 
and  an  Abox.  The  Tbox  consists  of  a  set  of  axioms  that  describe  constraints  on 
instances  of  given  concepts  (roughly  akin  to  a  conceptual  schema  in  a  database 
setting);  the  Abox  consists  of  a  set  of  axioms  that  assert  instance  relationships 
between  individuals  and  concepts,  and  role  relationships  between  pairs  of  indi¬ 
viduals  (roughly  akin  to  data  in  a  database  setting).  Tbox  axioms  are  of  the 
form  C  C  D  or  C  =  D,  where  C  and  D  are  concepts.  When  C  is  a  concept 
name,  such  an  axiom  is  often  called  a  definition  (of  C),  and  when  a  definition 
is  of  the  form  C  C  D,  C  is  called  a  primitive  concept. 

Classification  of  a  Tbox  is  the  task  of  computing  and  caching  the  concept 
hierarchy  for  all  of  the  named  concepts  that  occur  in  the  Tbox,  i.e.,  computing 
the  subsumption  partial  ordering  of  the  named  concepts.1  Tbox  classification  is  a 
basic  reasoning  task  for  DL  reasoners — the  concept  hierarchy  may  be  interesting 
in  its  own  right,  and  is  used  to  optimise  many  other  reasoning  tasks  (e.g.,  query 
answering).  This  latter  point  is  particularly  important  as,  even  if  the  Tbox  part 
of  a  KB  is  very  simple,  the  Abox  may  describe  instances  of  complex  concept 
expressions.  This  is  typical,  e.g.,  for  applications  of  the  Gene  Ontology  [4], 

We  will  first  briefly  recall  the  optimised  procedure  for  computing  the  concept 

1We  assume  w.l.o.g.  that  all  concept  names  occurring  in  the  Abox  also  occur  in  the  Tbox; 
this  can  be  achieved  by  adding  axioms  of  the  form  C  C  T  to  the  Tbox  for  any  concept  name 
C  that  would  otherwise  occur  only  in  the  Abox. 


hierarchy  first  described  in  [2] .  In  this  procedure,  all  concept  names  are  sorted 
into  definitional  order,  i.e.  if  concept  name  D  occurs  in  the  definition  of  concept 
name  C,  then  D  <  C.2  The  concept  hierarchy  is  initialised  to  contain  the  two 
concepts  T  and  _L  (with  T  being  a  super  concept  of  _L),  and  the  named  concepts 
are  classified  (added  to  the  hierarchy  in  the  appropriate  position)  one  at  a  time 
in  definitional  order. 

Classifying  a  concept  involves  two  phases:  a  top-down  phase  in  which  its  par¬ 
ents  (i.e.,  direct  subsumers)  are  computed,  and  a  bottom-up  phase  in  which  its 
children  (i.e.,  direct  subsumees)  are  computed.  In  many  cases,  adding  concepts 
in  definitional  order  may  allow  the  bottom-up  phase  to  be  omitted  (because 
when  a  new  concept  is  classified  its  only  child  will  be  _L). 

Various  optimisations  are  used  in  order  to  minimise  the  number  of  subsump¬ 
tion  tests  needed  in  each  phase.  For  example,  when  adding  a  concept  C  to  the 
hierarchy,  a  top-down  breadth  first  traversal  is  used  that  only  checks  if  D  sub¬ 
sumes  C  when  it  has  already  been  determined  that  C  is  subsumed  by  all  the 
concepts  in  the  hierarchy  that  subsume  D.  The  structure  of  Tbox  axioms  is  also 
used  to  compute  a  set  of  told  subsumers  of  C  (i.e.,  trivially  obvious  subsumers). 
For  example,  if  the  Tbox  contains  an  axiom  C  C  D\  n  D-2,  then  both  D\  and 
D2,  as  well  as  all  their  told  subsumers,  are  told  subsumers  of  C.  As  subsump¬ 
tion  is  immediate  for  told  subsumers,  no  tests  need  to  be  performed  w.r.t.  these 
concepts.  In  order  to  maximise  the  benefit  of  this  optimisation,  all  of  the  told 
subsumers  of  a  concept  C  are  classified  before  C  itself  is  classified. 

The  told  subsumer  optimisation  can  be  used  to  approximate  the  position  of 
C  in  the  hierarchy:  all  of  its  told  subsumers,  and  any  super-concepts  of  these 
told-subsumers,  can  be  marked  as  subsumers  of  C.  The  most  specific  concepts 
in  this  set  of  marked  concepts  are  then  candidates  to  be  parents  of  C .  In  the 
standard  algorithm,  however,  it  is  necessary  to  check  (recursively)  if  the  children 
of  these  concepts  are  also  subsumers  of  C.  When  it  has  been  determined  for 
some  subsumer  D  of  C  that  none  of  the  children  of  D  subsume  C ,  then  we  know 
that  D  is  a  parent  of  C . 

At  the  end  of  the  top-down  phase  we  will  have  computed  the  set  of  parents 
of  C\  all  of  the  concepts  in  this  set,  along  with  all  their  super-concepts,  are 
subsumers  of  (7;  all  other  concepts  are  non-subsumers  of  C.  The  next  step 
is  to  determine  the  set  of  children  of  C  (as  mentioned  above,  this  step  can  be 
omitted  for  a  primitive  concept  when  concepts  have  been  classified  in  definitional 
order  [2]).  This  phase  is  very  similar  to  (the  reverse  of)  the  top-down  one,  and 
as  our  optimisation  only  relates  to  the  top-down  phase  we  won’t  describe  it 
here — interested  readers  can  refer  to  [1]  for  full  details. 

For  large  and  shallow  taxonomies,  a  concept  D  may  have  hundreds  or  even 
thousands  of  children.  If,  when  classifying  a  concept  C,  one  of  its  told  subsumers 
is  D,  then  the  above  algorithm  may  lead  to  all  of  the  other  children  of  D  being 

2In  the  FaCT+- (-  implementation  we  actually  use  quasi- definitional  order ,  as  proposed  in  [5], 
but  to  simplify  the  presentation  we  will  assume  that  definitional  order  is  used. 


checked  to  see  if  they  subsume  C.  Although  the  time  taken  for  each  such  test 
may  be  small,  the  cumulative  cost  of  all  these  tests  may  be  prohibitive  when 
classifying  such  a  Tbox.  Moreover,  in  many  cases  all  of  these  tests  will  be 
negative  (i.e. ,  no  subsumption  relationship  will  be  found),  and  might  be  thought 
of  as  somehow  “wasted”.  The  objective  of  our  optimisation  is  to  avoid  these 
“wasted”  tests. 

3  Completely  Defined  Concepts 

Given  a  Tbox  T,  a  primitive  concept  C  is  said  to  be  completely  defined  w.r.t. 
T  when,  for  the  definition  C  C  C\  n  . . .  n  Cn  in  T,  it  holds  that: 

1.  For  all  1  ^  i  ^  n,  Ct  is  a  primitive  concept. 

2.  (minimality)  There  exist  no  i  j  such  that  1  <  i,  j  <  n  and  C\  Cj\~\. . .. 

When  the  Tbox  is  obvious  from  the  context  we  will  talk  about  completely  defined 
concepts  without  reference  to  the  Tbox. 

If  we  assume  a  cycle-free  Tbox  containing  only  CD  concepts  and  no  GGIs, 
then  the  classification  process  is  very  simple.  In  fact,  we  don’t  need  to  perform 
any  subsumption  tests  at  all:  the  position  of  every  concept  in  the  hierarchy  is 
completely  defined  by  its  told  subsumers.  If  concepts  are  processed  in  definitional 
order,  then  when  a  concept  C  is  classified,  where  C  is  defined  by  the  axiom 
C  U  C\  n  . . .  n  Cn,  the  parents  of  C  are  C\. ,  Cn .  and  the  only  child  of  C  is 
_L.  Note  that  every  concept  in  such  a  taxonomy  is  satisfiable,  because  there  is 
no  use  of  negation. 

The  following  theorem  is  straightforward: 

Theorem  1  If  a  cycle-free  KB  contains  only  completely -defined  concepts  and 
no  general  axioms,  then  the  taxonomy  built  by  the  above  method  will  be  correct. 

This  theorem  is,  however,  of  very  little  practical  value  due  to  the  very  strin¬ 
gent  conditions  on  the  structure  of  the  Tbox.  In  the  following  we  will  show 
how  the  basic  technique  can  be  made  more  useful  by  weakening  some  of  these 
conditions. 

Primitivity.  In  general,  a  CD  concept  should  not  have  non-primitive  concepts 
in  its  definition.  This  is  because,  when  the  hierarchy  already  includes  non¬ 
primitive  concepts  (which  will  be  the  case  given  definitional  order  classification) , 
the  bottom-up  phase  can  not  be  omitted,  and  the  CD  method  could  therefore 
lead  to  incorrect  results.  Assume,  e.g.,  a  TBox 

{C  c  cx  n  C2  n  C3,  C  =  Cx  n  C2}.  (l) 

Using  the  CD  classification  approach,  C  will  be  classified  under  C\,C2,C3, 
whereas  it  should  be  classified  under  C'  and  C3. 

One  case  in  which  this  condition  can  be  weakened  is  for  synonyms.  A  non¬ 
primitive  concept  C  is  a  synonym  if  it’s  definition  is  of  the  form  C  —  D,  where 


D  is  a  primitive  concept.  Synonyms  may  came  from  an  application  domain,  or 
occur  as  a  result  of  KB  simplification,  KB  merging,  etc. 

It  is  easy  to  see  that  synonyms  don’t  require  special  classification:  once  D  is 
classified,  C  will  take  the  same  place  in  the  hierarchy.  So,  adding  synonyms  to 
the  CD-only  KB  still  allows  application  of  the  CD  approach. 

Minimality.  Non-minima, 1  concepts  may  occur  as  a  result  of  badly  designed 
ontologies  and/or  due  to  absorption  of  GCIs.  The  minimality  check  may,  how¬ 
ever,  be  removed  from  the  definition  of  CD  concepts  in  the  classification  algo¬ 
rithm.  Indeed,  checking  if  each  Ct  in  a  definition  C  C  C\  n  . . .  n  Cn  is  really  a 
parent  of  C  (i.e.,  has  no  children  that  are  subsumers  of  C )  is  exactly  the  check 
that  is  needed  in  order  to  detect  non-minimality.  This  check  is  relatively  cheap 
and  already  exists  in  the  classification  algorithm. 

Non-CD  concepts.  This  is  the  most  important  case,  because  “interesting” 
ontologies,  including  most  ontologies  designed  using  DL  based  languages,  will 
contain  concept  constructors  other  than  conjunction,  and  this  will  lead  to  some 
concepts  being  non-CD.  This  means  that,  in  its  current  form,  the  CD  approach 
will  usually  be  largely  useless.  On  the  other  hand,  almost  all  KBs  will  contain 
some  CD  concepts.  In  this  case,  it  may  be  possible  to  split  the  Tbox  into  two 
parts — a  CD  part  (i.e.,  containing  only  CD  concepts)  and  a  non-CD  part — and 
use  the  CD  algorithm  only  for  the  CD  part. 

Note  that  such  a  split  will  not  introduce  any  problems  if  the  CD  part  of 
the  classification  is  performed  first — in  fact  the  classification  of  the  CD  part 
is  independent  of  the  non-CD  part  of  the  Tbox  because  the  definitions  of  CD 
concepts  only  refer  to  other  CD  concepts.  In  the  Tbox  1  above,  for  example, 
concepts  Ci,  C2,  C3  and  C  will  be  in  the  CD  part,  and  C'  in  the  non-CD  part. 
After  CD-classihcation  C  will  have  3  parents,  and  standard  algorithm  then  insert 
C  with  parents  C) ,  C'2  and  child  C. 

Cycles.  We  will  distinguish  two  kinds  of  definitional  cycles.  The  first  (and 
simplest)  is  a  cycle  via  concept  names,  as  in  the  Tbox  K  —  { A  C  B  n  C,  B  C 
A}.  This  kind  of  cycle  can  be  detected  syntactically  and  transformed  into  an 
equivalent  definition  K'  —  {A  C  C,  B  =  A}  where  A  is  a  CD  concept  and  B  is 
a  synonym  of  A. 

Any  other  kind  of  terminological  cycle  must  involve  non-CD  concepts,  and 
so  must  occur  in  the  non-CD  part  of  TBox.  In  this  case  it  will  be  dealt  with  in 
the  normal  way  by  the  standard  classification  algorithm. 

General  axioms.  GCIs  are  axioms  of  the  form  CCD,  where  C  and  1)  are 
arbitrary  concept  expressions.3  It  is  easy  to  see  that,  in  the  general  case,  the 
CD  approach  cannot  be  used  in  the  presence  of  GCIs.  Consider,  for  example,  a 
Tbox  K  =  {CC  T,  T  CD}.  In  this  case,  the  CD  algorithm  classifies  C  under 
T,  whereas  it  should  be  classified  under  D. 

3 Note  that,  in  case  there  are  multiple  axioms  of  the  form  C  C  D  or  C  =  D  for  some 
concept  name  C,  then  only  one  of  these  can  be  considered  the  definition  of  C,  and  the  rest 
must  be  treated  as  GCIs  (or,  in  the  case  of  C  =  D,  as  a  pair  of  GCIs  C  C  D  and  D  C.  C). 


Fortunately,  most  realistic  KBs  contain  only  general  axioms  that  can  be 
absorbed  into  either  concept  implications  [6]  or  role  domain  restrictions  [8],  and 
in  this  case  the  CD  approach  is  still  applicable. 

4  Two-stage  Approach  Using  CD. 

The  two-stage  CD  classification  algorithm  has  been  implemented  in  our  FaCTf- f- 
reasoner  as  follows.  First  of  all,  the  following  transformations  are  performed 
on  the  Tbox  (only  transformations  relevant  to  the  classification  algorithm  are 
mentioned  here): 

1.  Absorb  general  axioms  into  concept  definitions  and/or  role  domains.  If 
some  of  the  axioms  are  not  absorbable,  set  useCD  to  false.  If  all  the 
axioms  were  absorbed,  set  useCD  to  true. 

2.  Transform  simple  cycles  into  sets  of  synonyms. 

3.  If  useCD  is  true,  mark  some  concepts  as  CD.  Namely,  T  is  marked  as 
CD;  a  primitive  concept  C  is  marked  as  CD  iff  it  has  the  definition  C  C 
C'l  n  . . .  FI  Cn  and  every  C*  is  marked  CD;  a  non-primitive  concept  D  is 
marked  CD  iff  it  has  definition  D  —  C  and  C  is  marked  CD. 

If  useCD  is  true  after  the  preprocessing,  the  CD  classifier  is  run  prior  to 
the  general  classifier.  The  CD  classifier  works  on  concepts  that  are  marked  CD, 
processing  them  in  definitional  order.  For  each  such  concept  C,  the  steps  it 
performs  are  as  follows: 

1.  If  C  is  a  synonym  of  some  already  classified  concept  D,  then  insert  C  at 
the  same  place  as  D. 

2.  For  CD  C  with  definition  C  K  C\  n  . . .  n  Cn,  concepts  C\  ,  Cn  are 
candidates  to  be  parents  of  C . 

3.  For  every  candidate  6/,  check  whether  it  is  redundant,  i.e.  whether  (7; 
has  a  child  that  is  an  ancestor  of  a  C .  This  can  be  done  by  labelling  all 
ancestors  of  candidate  concepts:  labelled  candidates  will  be  redundant. 
Remove  redundant  candidates  from  the  list  of  candidates. 

4.  Insert  C  into  the  taxonomy  with  the  remaining  candidates  as  parents  and 
_L  as  the  only  child. 

Then  the  rest  of  the  ontology  is  then  classified  using  the  standard  classifica¬ 
tion  algorithm. 

We  have  tested  our  implementation  using  several  KBs:  NCI  is  the  National 
Cancer  Research  Institute  ontology;  GO  is  the  Gene  Ontology  from  the  Gene 
Ontology  Consortium;  GALEN  is  the  anatomical  part  of  the  well-known  medi¬ 
cal  terminology  ontology  [7].  Details  of  KB  characteristics  are  given  in  Table  1, 


KB 

PConcepts 

nCD 

NConcepts 

Synonyms 

NCI 

27652 

15195 

0 

0 

GO 

13926 

11718 

3 

0 

GALEN 

2048 

546 

699 

18 

Table  1:  test  KB  properties 


KB 

CD 

time 

nOps 

nTests 

nCache 

NCI 

no 

76.40 

1,614,903 

0 

10,311,489 

yes 

3.61 

1,012,281 

0 

766,054 

GO 

no 

7.40 

835,194 

30,834 

5,184,070 

yes 

3.67 

783,024 

29,768 

1,432,892 

GALEN 

no 

204.70 

67,524,538 

25,660 

82,962 

yes 

204.54 

68,032,698 

25,722 

43,043 

Table  2:  test  IvB  results 


where  PConcepts  is  the  number  of  primitive  concepts,  nCD  the  number  of  com¬ 
pletely  defined  concepts,  NConcepts  the  number  of  non-primitive  concepts,  and 
Synonyms  the  number  of  synonyms.  All  experiments  used  v. 0.99. 4  of  FaCT-H F 
running  under  Linux  on  an  Athlon  1.3GHz  machine  with  768Mb  of  memory. 

The  results  of  the  classification  tests  are  given  in  Table  2,  where  time  is  the 
time  taken  to  classify  the  KB  (in  seconds),  nOps  is  the  number  of  expansion 
rule  applications  during  the  classification  process,  nTests  is  the  number  of  sub¬ 
sumption  tests,  and  nCache  is  the  number  of  subsumptions  that  were  computed 
using  cached  models  [6]. 

Using  CD  speeds  up  the  classification  of  NCI  by  a  factor  of  more  than  20.  In 
both  cases,  all  subsumption  tests  are  solved  cheaply  using  cached  models,  but 
more  than  ten  million  tests  are  performed  when  CD  is  not  used;  employing  CD 
reduces  this  number  to  less  than  one  million.  Classification  of  GO  is  twice  as  fast 
with  CD  than  without  it.  Again,  GO  has  a  simple  structure,  but  is  very  broad, 
so  CD  still  gives  a  significant  reduction  in  the  large  number  of  cache  based  tests. 
GALEN  behaves  differently.  It  is  the  only  KB  where  more  “real”  (non  cache 
based)  subsumption  tests  are  performed  with  CD  than  without.  This  is  due  to 
the  large  number  of  non-primitive  concepts  that  are  classified  in  the  middle  of 
the  hierarchy.  Even  in  this  case,  however,  saving  large  numbers  of  cache  based 
tests  leads  to  a  slightly  smaller  overall  classification  time. 

5  Discussion 

The  proposed  classification  technique  is  applicable  to  a  large  number  of  real- 
life  ontologies,  i.e. ,  those  where  there  are  no  non-absorbed  GCIs.  The  best 
results  are,  of  course,  for  ontologies  with  large  numbers  of  primitive  concepts 
and  a  simple  structure,  but  even  in  cases  where  it  has  little  beneficial  effect,  it 
does  not  appear  to  have  any  detrimental  one.  The  number  of  such  ontologies 
may  decrease,  because  newly  created  ontologies  will  (probably)  use  more  of 


the  expressive  possibilities  provided  by  modern  DLs.  With  legacy  ontologies, 
however,  the  method  may  prove  to  be  very  useful. 

In  [5]  the  so-called  bucket  method  was  proposed  as  a  way  to  deal  with  broad 
and  shallow  hierarchies.  In  this  method,  when  some  concept  in  the  hierarchy 
is  found  to  have  a  large  number  of  children,  a  new  “virtual”  concept  is  added 
to  the  hierarchy;  this  non-primitive  concept  is  defined  to  be  equivalent  to  the 
disjunction  of  some  of  the  children  of  the  original  concept,  and  is  used  in  fast 
cache-based  comparison. 

It  is  possible  to  use  the  bucket  method  at  the  second  stage  of  the  CD  clas¬ 
sification  algorithm.  However,  this  method  will  not  improve  first  stage  of  the 
algorithm,  since  no  search  is  actually  performed  there. 


References 

[1]  F.  Baader,  D.  Calvanese,  D.  McGuinness,  D.  Nardi,  and  P.  F.  Patel- 
Schneider,  editors.  The  Description  Logic  Handbook:  Theory,  Implemen¬ 
tation  and  Applications.  Cambridge  University  Press,  2003. 

[2]  F.  Baader,  E.  Franconi,  B.  Hollunder,  B.  Nebel,  and  H.-J.  Profitlicli.  An 
empirical  analysis  of  optimization  techniques  for  terminological  representa¬ 
tion  systems  or:  Making  KRIS  get  a  move  on.  Applied  Artificial  Intelligence. 
Special  Issue  on  Knowledge  Base  Management ,  4:109-132,  1994. 

[3]  S.  Bechhofer,  F.  van  Harmelen,  J.  Hendler,  I.  Horrocks,  D.  L.  McGuin¬ 
ness,  P.  F.  Patel-Schneider,  and  L.  A.  Stein.  OWL  web  ontology  lan¬ 
guage  reference.  W3C  Recommendation,  10  February  2004.  Available  at 

http : //www. w3 . org/TR/owl-ref /. 

[4]  GOA  project.  European  Bioinformatics  Institute,  http://www.ebi.ac.uk/ 
GOA/. 

[5]  V.  Haarslev  and  R.  Moller.  High  performance  reasoning  with  very  large 
knowledge  bases:  A  practical  case  study.  In  Proc.  of  the  1 7th  Int.  Joint 
Conf.  on  Artificial  Intelligence  (IJCAI  2001),  pages  161-168,  2001. 

[6]  I.  Horrocks.  Using  an  expressive  description  logic:  FaCT  or  fiction?  In 
Proc.  of  the  6th  Int.  Conf.  on  Principles  of  Knowledge  Representation  and 
Reasoning  (KR’98),  pages  636-647,  1998. 

[7]  J.  E.  Rogers,  A.  Roberts,  W.  D.  Solomon,  E.  van  der  Haring,  C.  J.  Wroe, 
P.  E.  Zanstra,  and  A.  L.  Rector.  GALEN  ten  years  on:  Tasks  and  supporting 
tools.  In  Proc.  of  MEDINFO2001,  pages  256-260,  2001. 

[8]  D.  Tsarkov  and  I.  Horrocks.  Efficient  reasoning  with  range  and  domain 
constraints.  In  Proc.  of  the  2001  Description  Logic  Workshop  (DL  2007), 
pages  41-50,  2004. 


HR001 1-05-C-0094 


Lucent  Technologies 

Bell  Labs  Innovations 


Appendix  G: 

"Ordering  heuristics  for  description  logic  reasoning"  by  Dmitry  Tsarkov  and  Ian  Horrocks,  in 
Proceedings  of  the  Nineteenth  International  Joint  Conference  on  Artificial  Intelligence  (IJCAI 
2005),  Edinburgh,  Scotland,  August  2005.  Enclosed  with  the  full  electronic  version  of  this 
report  as  the  file  iicai2005 -fppinternals  .pdf. 


15 


Ordering  Heuristics  for  Description  Logic  Reasoning 


Dmitry  Tsarkov  and  Ian  Horrocks 

School  of  Computer  Science 
University  of  Manchester 
Manchester,  UK 
traskovlhorrocks@cs.man.ac.uk 


Abstract 

We  present  a  new  architecture  for  Description 
Logic  implementations,  a  range  of  new  optimisa¬ 
tion  techniques  and  an  empirical  analysis  of  their 
effectiveness. 

1  Introduction 

Description  Logics  (DLs)  are  a  family  of  logic  based  knowl¬ 
edge  representation  formalisms.  Although  they  have  a 
range  of  applications  (e.g.,  configuration  [McGuinness& 
Wright,  1998],  and  reasoning  with  database  schemas  and 
queries  [Calvanese  et  al,  1998b;  1998a]),  they  are  per¬ 
haps  best  known  as  the  basis  for  widely  used  ontology  lan¬ 
guages  such  as  OIL,  DAML+OIL  and  OWL  [Horrocks  et 
al,  2003].  As  well  as  DLs  providing  the  formal  under¬ 
pinnings  for  these  languages  (i.e.,  a  declarative  semantics), 
DL  systems  are  also  used  to  provide  computational  services 
for  ontology  tools  and  applications  [Knublauch  et  al,  2004; 
Rectore,  2003], 

Most  modern  DL  systems  are  based  on  tableaux  algo¬ 
rithms.  Such  algorithms  were  first  introduced  by  Schmidt- 
SchauB  and  Smolka  [Schmidt-SchauB  &  Smolka,  1991].  and 
subsequently  extended  to  deal  with  ever  more  expressive  log¬ 
ics  [Baader  et  al,  2003].  Many  systems  now  implement  the 
STilQ  DL,  a  tableaux  algorithm  for  which  was  first  pre¬ 
sented  in  [Horrocks  et  al,  1999];  this  logic  is  very  expres¬ 
sive,  and  corresponds  closely  to  the  OWL  ontology  language. 
In  spite  of  the  high  worst  case  complexity  of  the  satisfiabil¬ 
ity/subsumption  problem  for  this  logic  (ExpTime-complete), 
highly  optimised  implementations  have  been  shown  to  work 
well  in  many  realistic  (ontology)  applications  [Horrocks, 
1998]. 

Optimisation  is  crucial  to  the  viability  of  tableaux  based 
systems:  in  experiments  using  both  artificial  test  data  and 
application  ontologies,  (relatively)  unoptimised  systems  per¬ 
formed  very  badly,  often  being  (at  least)  several  orders  of 
magnitude  slower  than  optimised  systems;  in  many  cases, 
hours  of  processing  time  (in  some  cases  even  hundreds  of 
hours)  proved  insufficient  for  unoptimised  systems  to  solve 
problems  that  took  only  a  few  milliseconds  for  an  optimised 
system  [Massacci.  1999;  Horrocks  &  Patel-Schneider,  1998]. 
Modern  systems  typically  employ  a  wide  range  of  optimisa¬ 
tions,  including  (at  least)  those  described  in  [Baader  et  al, 
1994;  Horrocks  &  Patel-Schneider,  1999]. 

Tableaux  algorithms  try  to  construct  a  graph  (usually  a 
tree)  representation  of  a  model  of  a  concept,  the  structure  of 
which  is  determined  by  syntactic  decomposition  of  the  con¬ 
cept.  Most  implementations  employ  a  space  saving  optimisa¬ 


tion  known  as  the  trace  technique  that  uses  a  top-down  con¬ 
struction  requiring  (for  PSpace  logics)  only  polynomial  space 
in  order  to  delineate  a  tree  structure  that  may  be  exponential 
in  size  (with  respect  to  the  size  of  the  input  concept).  For 
the  ExpTime  logics  implemented  in  modern  systems,  how¬ 
ever,  guaranteeing  polynomial  space  usage  is  no  longer  an 
option.  Moreover,  for  logics  that  support  inverse  roles  (such 
as  STCIQ),  a  strictly  top  down  approach  is  no  longer  possible 
as  constraints  may  be  propagated  both  "up”  and  “down"  the 
edges  in  the  tree. 

We  describe  an  alternative  architecture  for  tableaux  imple¬ 
mentations  that  uses  a  (set  of)  queue(s)  instead  of  (an  adap¬ 
tion  of)  the  standard  top-down  approach.  This  architecture, 
which  we  have  implemented  in  our  new  FaCT++  system,  has 
a  number  of  advantages  when  compared  to  the  top-down  ap¬ 
proach.  Firstly,  it  is  applicable  to  a  much  wider  range  of  log¬ 
ics,  including  the  expressive  logics  implemented  in  modern 
systems,  because  it  makes  no  assumptions  about  the  structure 
of  the  graph  (in  particular,  whether  tree  shaped  or  not),  or  the 
order  in  which  the  graph  will  be  constructed.  Secondly,  it 
allows  for  the  use  of  more  powerful  heuristics  that  try  to  im¬ 
prove  typical  case  performance  by  varying  the  global  order 
in  which  different  syntactic  structures  are  decomposed;  in  a 
top-down  construction,  such  heuristics  can  only  operate  on  a 
local  region  of  the  graph — typically  a  single  vertex. 

2  Preliminaries 

We  present  here  a  brief  introduction  to  DL  (in  particular 
STCIQ)  syntax,  semantics  and  reasoning;  for  further  details 
the  reader  is  referred  to  [Baader  et  al,  2003]. 

2.1  Description  Logics 

Syntax  Let  R  be  a  set  of  role  names  with  both  transitive  and 
normal  role  names  R+  URp  =  R,  where  R+  fiRp  =  0.  The 
set  of  STLIQ-roles  (or  roles  for  short)  is  Ru{k“  |  R  6  R}. 
Let  Nc  be  a  set  of  concept  names.  The  set  of  STCIQ- 
concepts  (or  concepts  for  short)  is  the  smallest  set  such  that 
every  concept  name  C  6  Nc  is  a  concept,  and  if  C  and  D 
are  concepts,  R  is  a  role,  S'  is  a  simple  role1  and  n  €  IN,  then 
(C  n  D),  (C  U  D),  (-,<7),  (MR.C),  (3 R.C),  (< nR.C )  and 

nR.C )  are  also  concepts;  the  last  four  are  called,  respec¬ 
tively,  value,  exists,  atmost  and  atleast  restrictions. 

For  R  and  S  (possibly  inverse)  roles,  R  Q  S  is  called  a 
role  inclusion  axiom,  and  a  finite  set  of  role  inclusion  axioms 
is  called  a  role  hierarchy.  For  C  and  D  (possibly  complex) 

*A  simple  role  is  one  that  is  neither  transitive  nor  has  any  tran¬ 
sitive  subroles.  Restricting  number  restrictions  to  simple  roles  is 
required  for  decidability  [Horrocks  et  al,  1999]. 


concepts,  C  C  D  is  called  a  general  concept  inclusion  (GCI), 
and  a  Unite  set  of  GCIs  is  called  a  TBox. 

Semantics  An  interpretation  X  =  (A2,  -2)  consists  of  a  non¬ 
empty  set  A1,  the  domain  of  X,  and  a  function  -2  which  maps 
every  role  to  a  subset  of  A1  x  A2  such  that,  for  P  E  R  and 
R  E  R+,  {x,y)  E  P1  iff  (y,x)  E  P~X ,  and  if  {x,y)  E  R1 
and  (y.  z)  E  R1  then  (x,  z )  6  R1 .  The  interpretation  func¬ 
tion  -2  of  an  interpretation  X  =  (A2.  -2)  maps,  additionally, 
every  concept  to  a  subset  of  A2  such  that 

(C  n  D)1  =  C1  n  D1,  (C  U  D)1  =  C1  U  Dx, 
^CX=AT\CI, 

(3R.C)1  =  {x  E  A1  RI(x1  C)  £  0}, 

(VR.C)2  =  {x  €  A1  RI(x,  ~<C)  =  0}, 

(< nR.C )2  =  {x  6  A2  |  )j RT(x,C)  ^  n},  and 
(i^nR-C)1  =  {x  €  A2  |  ttJ?2(x,(7)  ^  n}, 

where  )JM  is  the  cardinality  of  a  set  M  and  _R2(x,  C)  is  de¬ 
fined  as  {y  |  (x,  y)  E  R1  and  y  E  C1}. 

An  interpretation  X  satisfies  a  role  hierarchy  TZ  iff  R1  c  S1 
for  each  R  C.  S  ETZ,  and  it  satisfies  a  TBox  T  iff  C1  C  D1 
for  each  CCTgT;  such  an  interpretation  is  called  a  model 
of  TZ  and  T . 

A  concept  C  is  satisfiable  w.r.t.  a  role  hierarchy  TZ  and  a 
TBox  T  iff  there  is  a  model  X  of  TZ  and  T  with  C1  0. 
Such  an  interpretation  is  called  a  model  of  C  w.r.t.  1Z  and  T. 
As  usual  for  expressive  DLs,  subsumption  can  be  reduced  to 
satisfiability,  and  reasoning  w.r.t.  a  TBox  and  role  hierarchy 
can  be  reduced  to  reasoning  w.r.t.  a  role  hierarchy  only  [Hor- 
rocks  et  ah,  1999]. 

2.2  Tableaux  Algorithms 

The  basic  idea  behind  a  tableau  algorithm  is  to  take  an  in¬ 
put  concept  C  and  role  hierarchy  TZ,  and  to  try  to  prove  the 
satisfiability  of  C  w.r.t.  XZ  by  constructing  a  model  X  of  C 
w.r.t.  TZ.  This  is  done  by  syntactically  decomposing  C  so  as 
to  derive  constraints  on  the  structure  of  such  a  model.  For 
example,  any  model  of  C  must,  by  definition,  contain  some 
individual  x  such  that  x  is  an  element  of  Cx ,  and  if  C  is  of 
the  form  3 R.D,  then  the  model  must  also  contain  an  individ¬ 
ual  y  such  that  (x,  y)  E  R1  and  y  is  an  element  of  D1;  if  D 
is  non-atomic,  then  continuing  with  the  decomposition  of  D 
would  lead  to  additional  constraints.  The  construction  fails 
if  the  constraints  include  a  clash  (an  obvious  contradiction), 
e.g.,  if  some  individual  a  must  be  an  element  of  both  C  and 
-i C  for  some  concept  C.  Algorithms  are  normally  designed 
so  that  they  are  guaranteed  to  terminate,  and  guaranteed  to 
construct  a  model  if  one  exists;  such  an  algorithm  is  clearly  a 
decision  procedure  for  concept  satisfiability. 

In  practice,  algorithms  often  work  on  a  tree  shaped  graph 
that  has  a  close  correspondence  to  a  model;  this  may  be  be¬ 
cause,  e.g.,  models  could  be  non-finite  (although  obviously 
finitely  representable),  or  non-trees  (although  usually  tree¬ 
like).  Typically  this  will  be  a  labelled  graph  (usually  a  tree 
or  collection  of  trees)  where  nodes  represent  individuals  in 
the  model,  and  are  labelled  with  a  set  of  concepts  of  which 
they  are  instances,  and  edges  represent  role  relationships  be¬ 
tween  pairs  of  individuals,  and  are  labelled  with  a  set  of  role 
names. 

The  decomposition  and  construction  is  usually  carried  out 
by  applying  so  called  tableaux  expansion  rules  to  the  concepts 
in  node  labels,  with  one  rule  being  defined  for  each  of  the 
syntactic  constructs  in  the  language  (with  the  exception  of 


negation,  which  is  pushed  inwards  using  re-writings  such  as 
de  Morgan’s  laws,  until  it  applies  only  to  atomic  concepts). 
For  example,  the  expansion  rule  for  conjunction  causes  C  and 
D  to  be  added  to  any  node  label  already  containing  C 13  D  (in 
order  to  guarantee  termination,  side  conditions  prevent  rules 
from  being  applied  if  they  do  not  change  either  the  graph  or 
its  labelling). 

There  are  two  forms  of  non-determinism  in  the  expansion 
procedure.  In  the  first  place,  many  rules  may  be  simultane¬ 
ously  applicable,  and  some  order  of  rule  applications  must  be 
chosen.  From  a  correctness  perspective,  this  choice  is  usually 
irrelevant2  (because,  if  there  is  a  model,  then  it  will  be  found 
by  any  expansion  ordering),  but  as  we  will  see  later,  the  order 
of  expansion  can  have  a  big  effect  on  efficiency.  In  the  second 
place,  some  rules  expand  the  graph  non-deterministically; 
e.g.,  the  expansion  rule  for  disjunction  causes  either  C  or 
D  to  be  added  to  any  node  label  already  containing  C  U  D. 
From  a  correctness  perspective,  this  choice  is  relevant  (be¬ 
cause  one  choice  may  lead  to  the  successful  construction  of 
a  model,  while  another  one  does  not),  and  is  usually  dealt 
with  by  backtracking  search.  Although  such  search  must  (in 
the  worst  case)  consider  all  possible  expansions,  the  order  in 
which  they  are  considered  can  still  have  a  big  effect  on  effi¬ 
ciency. 

Two  kinds  of  rule  will  be  of  particular  interest  in  the  fol¬ 
lowing  discussion:  non-deterministic  rules,  such  as  the  U-rule 
mentioned  above,  and  generating  rules,  such  as  the  3-rule, 
that  add  new  nodes  to  the  graph.  Applying  these  rules  is 
likely  to  be  more  “costly”,  as  they  either  increase  the  size 
of  the  graph  or  increase  the  size  of  the  search  space,  and  they 
are  typically  applied  with  lower  priority  than  other  rules. 

3  FaCT++  System  Architecture 

As  discussed  above,  many  implementations  use  a  top-down 
expansion  based  on  the  trace  technique.  The  idea  of  the  top- 
down  expansion  is  to  apply  the  3-rule  with  the  lowest  priority 
(i.e.,  only  apply  this  rule  when  no  other  rule  is  applicable);  the 
added  refinement  of  the  trace  technique  is  to  discard  fully  ex¬ 
panded  sub-trees,  so  that  only  a  single  “trace”  (i.e.,  a  branch 
of  the  tree)  is  kept  in  memory  at  any  one  time. 

This  technique  has  the  advantage  of  being  very  simple  and 
easy  to  implement — a  procedure  that  exhaustively  expands  a 
node  label  can  be  applied  to  the  current  node  and  then,  recur¬ 
sively,  to  each  of  its  successors.  It  does,  however,  have  some 
serious  drawbacks.  In  the  first  place,  for  logics  with  inverse 
roles,  the  top-down  method  simply  breaks  down  as  it  relies 
on  the  fact  that  rules  only  ever  add  concepts  to  the  label  of 
the  node  to  which  they  are  applied  or  to  the  label  of  one  of  its 
successor  nodes.  The  result  is  that,  once  the  rules  have  been 
exhaustively  applied  to  a  given  node  label,  no  further  expan¬ 
sion  of  that  label  will  be  possible.  In  the  presence  of  inverse 
roles,  expansion  rules  may  also  add  concepts  to  the  labels  of 
predecessor  nodes,  which  could  then  require  further  expan¬ 
sion.  Moreover,  discarding  fully  expanded  sub-trees  may  no 
longer  be  possible,  as  the  expansion  of  a  concept  added  to 
the  label  of  a  predecessor  may  cause  concepts  to  be  added 
to  the  label  of  a  sibling  node  that  had  previously  been  fully 
expanded. 

In  the  second  place,  the  top  down  method  forces  non- 
deterministic  rules  to  be  applied  with  a  higher  priority  than 
generating  rules.  As  the  size  of  the  search  space  caused 
by  non-deterministic  rule  expansions  is,  in  practice,  by 

2  Although  the  correctness  of  some  algorithms  requires  a  priority 
ordering  for  different  rules. 


far  the  most  serious  problem  for  tableaux  based  systems 
[Horrocks,  1997],  it  may  be  advantageous  to  apply  non- 
deterministic  rules  with  the  lowest  priority  [Giunchiglia  & 
Sebastiani,  1996].  In  fact,  top-down  implementations  typ¬ 
ically  apply  non-deterministic  rules  with  a  priority  that  is 
lower  than  that  of  all  of  the  other  rules  except  the  generat¬ 
ing  rules  [Horrocks  &  Patel-Schneider,  1999]. 

ToDo  List  Architecture  The  FaCT++  system  was  designed 
with  the  intention  of  implementing  DLs  that  include  inverse 
roles,  and  of  investigating  new  optimisation  techniques,  in¬ 
cluding  new  ordering  heuristics.  Currently,  FaCT++  imple¬ 
ments  SUIT,  a  slightly  less  expressive  variant  of  STLXQ 
where  the  values  in  atleast  and  atmost  restrictions  can  only 
be  zero  or  one.3 

Instead  of  the  top-down  approach,  FaCT++  uses  a  ToDo 
list  to  control  the  application  of  the  expansion  rules.  The  ba¬ 
sic  idea  behind  this  approach  is  that  rules  may  become  appli¬ 
cable  whenever  a  concept  is  added  to  a  node  label.  When  this 
happens,  a  note  of  the  node/concept  pair  is  added  to  the  ToDo 
list.  The  ToDo  list  sorts  all  entries  according  to  some  order, 
and  gives  access  to  the  “first”  element  in  the  list. 

A  given  tableaux  algorithm  takes  an  entry  from  the  ToDo 
list  and  processes  it  according  to  the  expansion  rule(s)  rele¬ 
vant  to  the  entry  (if  any).  During  the  expansion  process,  new 
concepts  may  be  added  to  node  labels,  and  hence  entries  may 
be  added  to  the  ToDo  list.  The  process  continues  until  either 
a  clash  occurs  or  the  ToDo  list  become  empty. 

In  FaCT++  we  implement  the  ToDo  list  architecture  as  a 
set  of  queues  (FIFO  buffers).  It  is  possible  to  set  a  priority 
for  each  rule  type  (e.g.,  fl  and  3),  and  a  separate  queue  is  im¬ 
plemented  for  each  unique  priority.  Whenever  the  expansion 
algorithm  asks  for  a  new  entry,  it  is  taken  from  the  non-empty 
queue  with  the  highest  priority,  and  the  algorithm  terminates 
when  all  the  queues  are  empty.  This  means  that  if  the  3-rule 
has  a  low  priority  (say  0),  and  all  other  rules  have  the  same 
priority  (say  1),  then  the  expansion  will  be  (modulo  inverse 
roles)  top-down  and  breadth  first;  if  stacks  (LIFO  buffers) 
were  used  instead  of  queues  with  the  same  priorities,  then 
the  expansion  would  simulate  the  standard  top-down  method. 

4  Heuristics 

When  implementing  reasoning  algorithms,  heuristics  can  be 
used  to  try  to  find  a  “good”  order  in  which  to  apply  infer¬ 
ence  rules  (we  will  call  these  rule-ordering  heuristics)  and, 
for  non-deterministic  rules,  the  order  in  which  to  explore  the 
different  expansion  choices  offered  by  rule  applications  (we 
will  call  these  expansion-ordering  heuristics).  The  aim  is  to 
choose  an  order  that  leads  rapidly  to  the  discovery  of  a  model 
(in  case  the  input  is  satisfiable)  or  to  a  proof  that  no  model 
exists  (in  case  the  input  is  unsatisfiable).  The  usual  technique 
is  to  compute  a  weighting  for  each  available  option,  and  to 
choose  the  option  with  the  highest  (or  lowest)  weight.  Much 
of  the  “art"  in  devising  useful  heuristics  is  in  finding  a  suit¬ 
able  compromise  between  the  cost  of  computing  the  weight¬ 
ings  and  their  accuracy  in  predicting  good  orderings. 

Such  heuristics  can  be  very  effective  in  improving  the 
performance  of  propositional  satisfiability  (SAT)  reason- 
ers  [Freeman,  1995],  but  finding  useful  heuristics  for  descrip¬ 
tion  and  modal  logics  has  proved  to  be  more  difficult.  Choos¬ 
ing  a  good  heuristic,  or  at  least  not  choosing  a  bad  one,  is  very 
important:  an  inappropriate  heuristic  may  not  simply  fail  to 
improve  performance,  it  may  seriously  degrade  it.  Even  more 

3 SUIT  corresponds  to  the  OWL-Lite  ontology  language  [Hor¬ 
rocks  et  at.,  2003]. 


problematical  is,  given  a  range  of  possible  heuristics,  choos¬ 
ing  the  best  one  to  use  for  a  given  (type  of)  problem. 

So  far,  the  heuristics  tried  with  DL  reasoners  have  mainly 
been  adaptions  of  those  already  developed  for  SAT  reason¬ 
ers,  such  as  the  well  known  MOMS  heuristic  [Freeman, 
1995]  and  Jeroslow  and  Wang’s  weighted  occurrences  heuris¬ 
tic  [Jeroslow  &  Wang,  1990],  These  proved  to  be  largely  in¬ 
effective,  and  even  to  degrade  performance  due  to  an  adverse 
interaction  with  backjumping  [Baader  et  ah,  2003].  An  al¬ 
ternative  heuristic,  first  presented  in  [Horrocks,  1997],  tries 
to  maximise  the  effect  of  dependency  directed  backtracking 
(backjumping)  by  preferentially  choosing  expansions  that  in¬ 
troduce  concept  with  “old”  dependencies.  Even  this  heuristic, 
however,  has  relatively  little  effect  on  performance  with  real¬ 
istic  problems,  e.g.,  problems  encountered  when  reasoning 
with  application  ontologies. 

We  conjecture  that  the  standard  top-down  architecture  has 
contributed  to  the  difficulty  in  finding  useful  heuristics  as  it 
rules  out  many  possible  choices  of  rule-ordering;  in  particu¬ 
lar,  the  top-down  technique  may  require  generating  rules  to  be 
applied  with  a  low  priority,  and  so  lead  to  non-deterministic 
rules  being  applied  before  deterministic  generating  rules.  In 
contrast,  the  ToDo  list  architecture  gives  a  much  wider  range 
of  possible  rule  orderings,  and  so  has  allowed  us  to  investigate 
a  range  of  new  rule-ordering  heuristics,  in  particular  heuris¬ 
tics  that  give  non-deterministic  rules  the  lowest  priority. 

Another  factor  that  has  contributed  to  the  weakness  of  SAT 
derived  heuristics  is  that  they  treat  concepts  as  though  they 
were  atoms.  This  is  obviously  appropriate  in  the  case  of 
propositional  satisfiability,  but  not  in  the  case  of  concept  sat¬ 
isfiability  where  sub-concepts  may  have  a  complex  structure. 
We  have  also  investigated  expansion-ordering  heuristics  that 
take  into  account  this  structure,  in  particular  a  concept’s  size, 
maximum  quantifier  depth,  and  frequency  of  usage  in  the 
knowledge  base. 

Implementation  in  FaCT++  The  FaCT++  reasoner  uses  the 
standard  backtracking  search  technique  to  explore  the  differ¬ 
ent  possible  expansions  offered  by  non-deterministic  rules 
(such  as  the  U-rule).  Before  applying  a  non-deterministic 
rule,  the  current  state  is  saved,  and  when  backtracking,  the 
state  is  restored  before  re-applying  the  same  rule  (with  a  dif¬ 
ferent  expansion  choice).  When  inverse  roles  are  supported, 
it  is  possible  for  a  sequence  of  deterministic  rule  applications 
to  propagate  changes  throughout  the  graph,  and  it  may,  there¬ 
fore,  be  necessary  to  save  and  restore  the  whole  graph  struc¬ 
ture  (in  addition  to  other  data  structures  such  as  the  ToDo 
list).  FaCT++  trys  to  minimise  the  potentially  high  cost  of 
these  operations  by  lazily  saving  the  graph,  (i.e.,  saving  parts 
of  the  graph  only  as  necessitated  by  the  expansion),  but  the 
cost  of  saving  the  state  still  makes  it  expensive  to  apply  a 
non-deterministic  rule,  even  if  the  state  is  never  restored  dur¬ 
ing  backtracking. 

As  discussed  in  Section  3,  FaCT++  uses  a  ToDo  list  ar¬ 
chitecture  with  separate  queues  for  each  priority  level.  Dif¬ 
ferent  rule-ordering  heuristics  can,  therefore,  be  tried  sim¬ 
ply  by  varying  the  priorities  assigned  to  different  rule  types. 
Low  priorities  are  typically  given  to  generating  and  non- 
deterministic  rules,  but  the  ToDo  list  architecture  allows  dif¬ 
ferent  priority  ordering  of  these  rule  types;  in  contrast,  the 
top-down  architecture  forces  a  lower  priority  to  be  given  to 
generating  rules. 

FaCT++  also  includes  a  range  of  different  expansion¬ 
ordering  heuristics  that  can  be  used  to  choose  the  order  in 
which  to  explore  the  different  expansion  choices  offered  by 
the  non-deterministic  U-rule.  This  ordering  can  be  on  the  ba- 


sis  of  the  size,  maximum  quantifier  depth,  or  frequency  of  us¬ 
age  of  each  of  the  concepts  in  the  disjunction,  and  the  order 
can  be  either  ascending  (smallest  size,  minimum  depth  and 
lowest  frequency  first)  or  descending.  In  order  to  avoid  the 
cost  of  repeatedly  computing  such  values,  FaCT++  gathers 
all  the  relevant  statistics  for  each  concept  as  the  knowledge 
base  is  loaded,  and  caches  them  for  later  use. 

5  Empirical  Analysis 

In  order  to  evaluate  the  usefulness  of  the  heuristics  imple¬ 
mented  in  FaCT++,  we  have  carried  out  an  empirical  analy¬ 
sis  using  both  real-life  ontologies  and  artificial  tests  from  the 
DL'98  test  suite  [Horrocks  &  Patel-Schneider,  1998]. 

Ontologies  can  vary  widely  in  terms  of  size  and  complexity 
(e.g.,  structure  of  concepts,  and  types  of  axiom  used).  We 
used  three  ontologies  with  different  characteristics  in  order  to 
see  how  the  heuristics  would  perform  in  each  case: 
WineFood  A  sample  ontology  that  makes  up  part  of  the 
OWL  test  suit4  [Carroll  &De  Roo,  2004];  it  is  small,  but 
has  a  complex  structure  and  includes  150  GCIs. 

DOLCE  A  foundational  (top-level)  ontology,  developed  in 
the  WonderWeb  project  [Gangemi  et  al.,  2002];  it  is  of 
medium  size  and  medium  complexity. 

GALEN  The  anatomical  part  of  the  well-known  medical 
terminology  ontology  [Rogers  et  al.,  2001];  it  is  large 
(4,000  concepts)  and  has  a  relatively  simple  structure, 
but  includes  over  400  GCIs. 

FaCT++  separates  the  classification  process  into  satisfia¬ 
bility  testing  (SAT)  and  subsumption  testing  (SUB)  phases; 
the  results  from  the  SAT  phase  are  cached  and  used  to  speed 
up  subsequent  tests  via  a  standard  “model-merging"  optimi¬ 
sation  [Horrocks  &  Patel-Schneider,  1999],  FaCT++  allows 
different  heuristics  to  be  used  in  the  two  phases  of  the  pro¬ 
cess;  this  is  because  the  tests  have  different  characteristics:  in 
the  SAT  phase,  nearly  all  of  the  tests  are  satisfiable  (ontolo¬ 
gies  typically  do  not  give  names  to  unsatisfiable  concepts), 
while  in  the  SUB  phase,  up  to  one  in  four  of  the  tests  are  un¬ 
satisfiable.  We  measured  the  time  (in  CPU  seconds)  taken  by 
FaCT++  to  complete  each  phase. 

In  addition  to  the  ontologies,  we  used  artificially  generated 
test  data  from  the  DL'98  test  suite.  Artificial  tests  are  in  some 
sense  corner  cases  for  a  DL  reasoner  designed  primarily  for 
ontology  reasoning,  and  these  tests  are  mainly  intended  to  in¬ 
vestigate  the  effect  of  hard  problems  with  very  artificial  struc¬ 
tures  on  the  behaviour  of  our  heuristics.  For  this  purpose  we 
selected  from  the  test  suite  several  of  the  tests  that  proved  to 
be  hard  for  FaCT++. 

Each  of  these  tests  consists  of  a  set  of  21  satisfiability  test¬ 
ing  problems  of  similar  structure,  but  (supposedly  exponen¬ 
tially)  increasing  difficulty;  the  idea  of  the  test  is  to  determine 
the  number  of  the  largest  problem  that  can  be  solved  within  a 
fixed  amount  of  processing  time  (100  seconds  of  CPU  time  in 
our  case).  The  names  of  the  tests  are  of  the  form  “test_p” 
or  “test_n”,  where  “test”  refers  to  the  kind  of  problem 
(e.g.,  the  “ph”  tests  are  derived  from  encodings  of  pigeon 
hole  sorting  problems),  and  “p/n”  refers  to  whether  the  prob¬ 
lems  in  the  test  set  are  satisfiable  (n)  or  unsatisfiable  (p).  For 
these  tests  we  have  reported  the  number  of  the  largest  prob¬ 
lem  solved  in  less  than  100  seconds  (21  means  that  all  the 
problems  were  solved),  along  with  the  time  (in  CPU  seconds) 
taken  for  the  hardest  problem  that  was  successfully  solved. 

4This  ontology  therefore  has  a  much  weaker  claim  to  being  “real- 
life”. 


For  all  the  tests,  FaCT++  v.0.99.2  was  used  on  Pentium 
4  2.2  GHz  machine  with  512Mb  of  memory,  running  Linux. 
Times  were  averaged  over  3  test  runs. 

5.1  Rule-ordering  Heuristics 

In  these  tests  we  tried  a  range  of  different  rule-ordering  strate¬ 
gies.  Each  “strategy”  is  shown  as  a  sequence  of  letters  spec¬ 
ifying  the  priorities  (highest  first)  of  the  different  rule  types, 
where  “O”  refers  to  the  U-rule,  “E"  to  the  3-rule,  and  “a”  to 
any  other  rule  type.  E.g.,  “aO”  describes  the  strategy  where 
the  U-rule  has  the  lowest  priority,  and  all  other  rules  have  an 
equal  higher  priority. 

Ontology  tests  The  results  of  using  different  rule-ordering 
strategies  with  the  various  ontologies  are  shown  in  Table  1 . 
All  ontologies  were  tested  with  the  best  disjunction-ordering 
heuristic,  as  determined  in  separate  tests  (see  below). 
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Table  1 :  Ontology  tests  with  different  rule-orderings 
The  first  thing  to  note  is  that  rule-orderings  have  relatively 
little  effect  on  the  DOLCE  and  WineFood  ontologies;  in  con¬ 
trast,  the  performance  of  the  best  and  worst  strategies  dif¬ 
fers  by  a  factor  of  almost  10  in  the  GALEN  tests.  Even  in 
the  GALEN  case,  however,  the  difference  between  the  “-O” 
strategies  (i.e.,  those  that  assign  the  lowest  priority  to  the  U- 
rule)  and  “-E"  strategies  (i.e.,  those  that  assign  the  lowest  pri¬ 
ority  to  the  3-rule)  is  relatively  small.  In  most  cases  the  best 
result  is  given  by  the  "aEO”  strategy,  i.e.,  by  assigning  the 
lowest  priority  to  the  U-rule  and  the  next  lowest  priority  to  the 
3-rule,  and  even  when  “aEO”  is  not  the  best  strategy,  the  dif¬ 
ference  between  it  and  the  best  strategy  is  very  small.  More¬ 
over,  the  difference  between  the  “aEO”  and  “aOE”  strategies 
is  small  in  most  cases,  and  never  more  than  a  factor  of  2. 
DL98  tests  The  results  of  using  different  rule-ordering  strate¬ 
gies  with  the  DL98  tests  are  shown  in  Table  2.  The  first  thing 
to  note  from  these  results  is  that  rule-ordering  heuristics  can 
have  a  much  more  significant  effect  than  in  the  ontology  tests: 
in  some  cases  the  performance  of  the  best  and  worst  strategies 
differs  by  a  factor  of  more  than  100.  In  most  tests,  the  “-E” 
strategies  give  the  best  results,  with  the  difference  between 
“-O”  and  “-E”  strategies  being  much  more  marked  than  in 
the  case  of  the  ontology  tests.  In  the  case  of  the  d4_n  test, 
however,  performance  is  dramatically  improved  (by  a  factor 
of  20)  when  an  “-0”  strategy  is  used. 
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Table  2:  DL-98  tests  with  different  rule-ordering  strategies 
5.2  Expansion-ordering  Heuristics 
In  these  tests  we  tried  a  range  of  different  expansion-ordering 
heuristics.  Each  heuristic  is  denoted  by  two  letters,  the  first  of 


which  indicates  whether  the  ordering  is  based  on  concept  size 
(“S”),  maximum  depth  (“D”)  or  frequency  of  usage  (“F”),  and 
the  second  of  which  indicates  ascending  (“a”)  or  descending 
(“d”)  order.  In  each  group  of  tests  we  used  the  best  rule¬ 
ordering  heuristic  as  determined  by  the  tests  in  Section  5.1. 
Ontology  tests  For  the  ontology  tests,  we  tried  different  or¬ 
derings  for  the  SAT  and  SUB  phases  of  classification.  The 
results  are  presented  in  Tables  3,  4  and  5;  the  first  figure  in 
each  column  is  the  time  taken  by  the  SAT  phase  using  the 
given  ordering,  and  the  remaining  figures  are  the  subsequent 
times  taken  using  different  SUB  phase  orderings. 

For  DOLCE  (Table  3),  the  difference  between  the  best  and 
worst  orderings  was  a  factor  of  about  4,  and  many  possible  or¬ 
derings  were  near  optimal.  For  WineFood  (Table  4),  the  dif¬ 
ference  between  the  best  and  worst  orderings  was  a  factor  of 
about  2,  and  using  Sd  for  SAT  tests  and  Dd  for  SUB  tests  gave 
the  best  result,  although  several  other  orderings  gave  similar 
results.  For  GALEN  (Table  5),  the  difference  between  the 
best  and  worst  orderings  was  so  large  that  we  were  only  the 
orderings  given  allowed  tests  to  be  completed  in  a  reasonable 
time.  The  best  result  was  given  by  using  Da  for  both  phases. 
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Table  3:  DOLCE  test  with  different  expansion-orderings 
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1.83 

Dd 

1.69 

1.77 

1.87 

1.66 

1.78 

1.78 

Fd 

2.30 

2.26 

2.75 

3.14 

3.54 

2.76 

Table  4:  WineFood  test  with  different  expansion-orderings 


SAT 

Sa 

Dir 

SUB 

18775” 

9^8““ 

Sa 

Da 

Fd 

276.90 

185.79 

1049.74 

276.16 

172.89 

943.06 

Table  5:  GALEN  test  with  different  expansion-orderings 

DL98  tests  Table  6  presents  the  results  for  the  DL98  tests. 
Each  column  shows  the  times  taken  using  different  expan¬ 
sion  orderings  to  solve  the  hardest  problem  that  was  solvable 
within  the  stipulated  time  limit  using  any  ordering. 

In  almost  every  test,  the  difference  between  the  best  and 
worst  strategies  is  large:  a  factor  of  more  than  300  in  the  d4_n 
test.  Moreover,  strategies  that  are  good  in  one  test  can  be  very 
bad  in  another  (the  Sd  and  Dd  strategies  are  the  best  ones  in 
the  branch  tests  (br_n  and  br_p),  but  (by  far)  the  worst  in  the 
d4_n  test),  and  this  is  not  strongly  dependent  on  the  satisfi¬ 
ability  result  (in  the  br  tests,  all  strategies  perform  similarly 
in  both  satisfiable  and  unsatisfiable  cases).  The  Fd  strategy  is, 
however,  either  optimal  or  near  optimal  in  all  cases. 


order 

br_n 

br_p 

d4_n 

ph_n 

ph-P 

test  1 I 

test  12 

test  21 

test  10 

test  7 

Sa 

22.6 

24.8 

0.9 

8.1 

29.5 

Da 

22.6 

24.8 

0.9 

>300 

24.5 

Fa 

>300 

>300 

32.0 

22.9 

20.2 

Sd 

17.0 

18.3 

>300 

38.7 

24.7 

Dd 

17.1 

18.3 

>300 

19.7 

19.3 

Fd 

22.2 

25.1 

0.8 

6.2 

15.3 

Table  6:  DL98  tests  with  different  Or  strategies 


5.3  Analysis 

The  different  rule-ordering  heuristics  we  tried  had  relatively 
little  effect  on  the  performance  of  the  reasoner  when  classify¬ 
ing  the  DOLCE  and  WineFood  ontologies.  With  the  GALEN 
ontology,  any  strategy  that  gave  a  lower  priority  to  the  3-  and 
U-rules  worked  reasonably  well,  and  the  aEO  strategy  was 
optimal  or  near-optimal  in  all  cases.  The  crucial  factor  with 
GALEN  is  giving  low  priority  to  the  3-rule.  This  is  due  to 
the  fact  that  GALEN  is  large,  contains  many  GCIs  and  also 
contains  existential  cycles  in  concept  inclusion  axioms  (e.g., 
C  C  3 R.D  and  D  C  3 R~ . C );  as  a  result,  the  graph  can  grow 
very  large,  and  this  increases  both  the  size  of  the  search  space 
(because  GCI  related  non-determinism  may  apply  on  a  per- 
node  basis)  and  the  cost  of  saving  and  restoring  the  state  dur¬ 
ing  backtracking  search.  Giving  a  low  priority  to  the  3-rule 
minimises  the  size  of  the  graph  and  hence  can  reduce  both  the 
size  of  the  search  space  and  the  cost  of  saving  and  restoring. 
This  effect  is  less  noticeable  with  the  other  ontologies  be¬ 
cause  their  smaller  size  and/or  lower  number  of  GCIs  greatly 
reduces  the  maximum  size  of  graphs  and/or  search  space.  In 
view  of  these  results,  FaCT++'s  default  rule-ordering  strategy 
has  been  set  to  aEO.5 

The  picture  is  quite  different  in  the  case  of  the  DL'98  tests. 
Here,  different  strategies  can  make  a  large  difference,  and  no 
one  strategy  is  universally  near  optimal.  This  is  to  be  ex¬ 
pected,  given  that  some  of  the  tests  include  very  little  non¬ 
determinism,  but  are  designed  to  force  the  construction  of 
very  large  models  (and  hence  graphs),  while  others  are  highly 
non-deterministic,  but  have  only  very  small  models.  Given 
that  these  extreme  cases  are  not  representative  of  typical  real- 
life  ontologies,  the  test  results  may  not  be  directly  relevant  to 
a  system  designed  to  deal  with  such  ontologies.  It  is  inter¬ 
esting,  however,  to  see  how  badly  the  heuristics  can  behave 
in  such  cases:  in  fact  the  standard  aEO  strategy  is  near  opti¬ 
mal  in  two  of  the  tests,  and  is  never  worse  than  the  optimal 
strategy  by  a  factor  of  more  than  2. 

The  expansion-ordering  heuristics  had  a  much  bigger 
effect  on  ontology  reasoning  performance  (than  the  rule¬ 
ordering  heuristics).  In  the  case  of  DOLCE  and  WineFood, 
almost  any  strategy  that  uses  Sd  or  Dd  in  the  SUB  phase  is 
near  optimal.  For  GALEN,  however,  using  Da  in  both  phases 
gives  by  far  the  best  results.  This  is  again  due  to  the  character¬ 
istic  structure  of  this  ontology,  and  the  fact  that  preferentially 
choosing  concepts  with  low  modal  depth  tends  to  reduce  the 
size  of  the  graph.  Unfortunately,  no  one  strategy  is  univer¬ 
sally  good  (Da/Da  is  best  for  GALEN  but  worst  for  DOLCE 
and  WineFood);  currently,  Sd/Dd  is  the  default  setting,  as  the 
majority  of  real  life  ontologies  resemble  DOLCE  and  Wine¬ 
Food  more  than  GALEN),  but  this  can  of  course  be  changed 

5Top-down  architectures  necessarily  give  lowest  priority  to  the 
3-rule,  and  generally  give  low  priority  to  U-rule,  which  is  why  they 
work  relatively  well  with  ontologies. 


by  the  user  if  it  is  known  that  the  ontology  to  be  reasoned  with 
will  have  a  GALEN-like  structure. 

For  the  DL'98  tests,  the  picture  is  again  quite  confused: 
the  Sd  strategy  (the  default  in  the  SAT  phase)  is  optimal  in 
some  tests,  but  bad  in  others — disastrously  so  in  the  case  of 
the  d4_n  test.  As  in  the  ontology  case,  the  only  “solution" 
offered  at  present  is  to  allow  users  to  tune  these  settings  ac¬ 
cording  to  the  problem  type  or  empirical  results. 

6  Discussion  and  Future  Work 

We  have  described  the  ToDo  list  architecture  used  in  the 
FaCT++  system  along  with  a  range  of  heuristics  that  can  be 
used  for  rule  and  expansion  ordering.  We  have  also  presented 
an  empirical  analysis  of  these  heuristics  and  shown  how  these 
have  led  us  to  select  the  default  setting  currently  used  by 
FaCT++. 

These  default  settings  reflect  the  current  predominance  of 
relatively  small  and  simply  structured  ontologies.  This  may 
not,  however,  be  a  realistic  picture  of  the  kinds  of  ontology 
that  we  can  expect  in  the  future:  many  existing  ontologies  (in¬ 
cluding,  e.g.,  WineFood)  pre-date  the  development  of  OWE, 
and  have  been  translated  from  less  expressive  formalisms. 
With  more  widespread  use  of  OWE,  and  the  increasing  avail¬ 
ability  of  sophisticated  ontology  development  tools,  it  may  be 
reasonable  to  expect  the  emergence  of  larger  and  more  com¬ 
plex  ontologies.  As  we  have  seen  in  Section  5.1,  heuristics 
can  be  very  effective  in  helping  us  to  deal  efficiently  with 
such  ontologies,  but  choosing  a  suitable  heuristic  becomes  of 
critical  importance. 

In  our  existing  implementation,  changing  heuristics  re¬ 
quires  the  user  to  set  the  appropriate  parameters  when  using 
the  reasoner.  This  is  clearly  undesirable  at  best,  and  unre¬ 
alistic  for  non-expert  users.  We  are,  therefore,  working  on 
techniques  that  will  allow  us  to  guess  the  most  appropriate 
heuristics  for  a  given  ontology.  The  idea  is  to  make  an  initial 
guess  based  on  an  analysis  of  the  syntactic  structure  of  the  on¬ 
tology  (it  should  be  quite  easy  to  distinguish  GALEN-like  on¬ 
tologies  from  DOLCE  and  WineFood-like  ontologies  simply 
by  examining  the  statistics  that  have  already  been  gathered 
for  use  in  expansion-ordering  heuristics),  with  subsequent  ad¬ 
justments  being  made  based  on  the  behaviour  of  the  algorithm 
(e.g.,  the  size  of  graphs  being  constructed). 

Another  limitation  of  the  existing  implementation  is  that  a 
single  strategy  is  used  for  all  the  tests  performed  in  the  clas¬ 
sification  process.  In  practice,  the  characteristics  of  differ¬ 
ent  tests  (e.g.,  w.r.t.  concept  size  and/or  satisfiability)  may 
vary  considerable,  and  it  may  make  sense  to  dynamically 
switch  heuristics  depending  on  the  kind  of  test  being  per¬ 
formed.  This  again  depends  on  having  an  effective  (and 
cheap)  method  for  analysing  the  likely  characteristics  of  a 
given  test,  and  syntactic  and  behavioural  analyses  will  also 
be  investigated  in  this  context. 
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Abstract.  Tableaux  algorithms  are  currently  the  most  widely-used  and  empirically  the  fastest 
algorithms  for  reasoning  in  expressive  Description  Logics,  including  the  important  Descrip¬ 
tion  Logics  STLIQ  and  STLOIQ.  Achieving  a  high  level  of  performance  on  terminological 
reasoning  in  expressive  Description  Logics  when  using  tableaux-based  algorithms  requires 
the  incorporation  of  a  wide  variety  of  optimisations.  The  Description  Logic  system  FaCT++ 
implements  a  wide  variety  of  such  optimisations,  some  present  in  other  reasoners  and  some 
novel  or  refined  in  FaCT++.  Together  these  optimisations  make  FaCT++  one  of  the  fastest 
current  systems  for  terminological  reasoning  on  expressive  Description  Logics. 

Keywords:  Description  Logic,  Reasoning  Systems,  Optimisations 


1.  Introduction 

Description  Logics  (DLs)  are  a  family  of  logic  based  knowledge  representa¬ 
tion  formalisms.  Although  they  have  a  range  of  applications  (e.g.,  configura¬ 
tion  (McGuinness  and  Wright,  1998)  and  reasoning  with  database  schemas 
and  queries  (Calvanese  et  al.,  1998b;  Calvanese  et  al.,  1998a)),  they  are  per¬ 
haps  best  known  as  the  basis  for  widely  used  ontology  languages  such  as 
OIL  (Fensel  et  al.,  2001 ),  DAML+OIL  (Horrocks  et  al.,  2002)  and  OWL  (Hor¬ 
rocks  et  ah,  2003).  As  well  as  DLs  providing  the  formal  underpinnings  for 
these  ontology  languages  (by  means  of  the  declarative  semantics  of  DLs),  DL 
systems  are  also  used  to  provide  computational  services  for  ontology  tools 
and  applications  (Knublauch  et  ah,  2004;  Rector,  2003).  Such  services  typ¬ 
ically  include  (at  least)  computing  the  subsumption  hierarchy  and  checking 
the  consistency  of  named  concepts  in  an  ontology. 

Most  modern  DL  systems  are  based  on  tableaux  algorithms.  Such  al¬ 
gorithms  were  first  introduced  by  Schmidt-Schau B  and  Smolka  (Schmidt- 
SchauB  and  Smolka,  1991),  and  subsequently  extended  to  deal  with  a  wide 
range  of  different  logics  (Baader  et  ah,  2003).  Many  systems  now  implement 
the  STLIQ  DL,  a  tableaux  algorithm  for  which  was  first  presented  by  Hor¬ 
rocks,  Sattler,  and  Tobies  (Horrocks  et  ah,  1999),  or  its  extension  STLOIQ , 
whose  tableaux  algorithm  was  recently  described  by  Horrocks  and  Sattler 
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(Horrocks  and  Sattler,  2005).  These  two  DLs  are  very  expressive,  and  cor¬ 
respond  closely  to  the  OWL  ontology  language.  (Reasoning  in  OWL  Lite 
can  be  reduced  to  reasoning  in  STLIQ  and  reasoning  in  OWL  DL  can  be 
reduced  to  reasoning  in  STLOIQ  (Horrocks  and  Patel-Schneider,  2003).)  In 
spite  of  the  high  worst  case  complexity  of  the  satisfiability/subsumption  prob¬ 
lem  for  this  logic  (ExpTime-complete  for  STLIQ  and  NExpTime-complete 
for  STLOIQ ),  highly  optimised  implementations  of  the  tableaux  algorithms 
for  STLIQ  and  STLOIQ  have  been  shown  to  work  well  in  many  realistic 
(ontology)  applications  (Horrocks,  1998;  Pan,  2005;  Stevens  et  al„  2002; 
Wolstencroft  et  al.,  2005). 

Optimisation  is  crucial  to  the  viability  of  systems  that  employ  tableaux  - 
based  algorithms  for  DL  reasoning:  in  experiments  using  both  artificial  test 
data  and  application  ontologies,  (relatively)  unoptimised  systems  performed 
very  badly,  often  being  (at  least)  several  orders  of  magnitude  slower  than  op¬ 
timised  systems;  in  many  cases,  hours  of  processing  time  (in  some  cases  even 
hundreds  of  hours)  proved  insufficient  for  unoptimised  systems  to  solve  prob¬ 
lems  that  took  only  a  few  milliseconds  for  an  optimised  system  (Massacci, 
1999;  Horrocks  and  Patel-Schneider,  1998). 

The  optimisation  of  tableaux  based  systems  has  been  the  subject  of  exten¬ 
sive  study  over  the  course  of  the  last  fifteen  years  (Baader  et  al.,  1994;  Hor¬ 
rocks,  1998;  Haarslev  and  Moller,  2001a;  Haarslev  and  Moller,  2001b;  Hor¬ 
rocks,  2003;  Horrocks  and  Sattler,  2002;  Tsarkov  and  Horrocks,  2005b),  and 
modern  systems  typically  employ  a  wide  range  of  optimisations,  including 
(at  least)  those  described  by  Baader  et  al  (Baader  et  al.,  1994)  and  Horrocks 
and  Patel-Schneider  (Horrocks  and  Patel-Schneider,  1999). 

In  this  paper  we  describe  the  optimisation  techniques  employed  in  our 
FaCT++  reasoner  (Tsarkov  and  Horrocks,  2005b).  While  focusing  mainly 
on  novel  techniques  and  significant  refinements  and  extensions  of  previously 
known  techniques,  we  have  also  described  the  “standard”  techniques  as  well 
as  some  simple  techniques  that  are  widely  employed  but  often  not  reported 
in  the  literature.  In  this  way  we  hope  to  provide  both  a  report  on  the  novel 
techniques  developed  in  the  FaCT++  system  and  a  reasonably  comprehen¬ 
sive  survey  of  the  optimisation  techniques  employed  in  state  of  the  art  DL 
reasoning  systems. 

Many  of  the  techniques  we  will  describe  could  be  applied  to  a  wide  range 
of  DLs.  We  will,  however,  focus  on  STLOIQ ,  because 

—  the  expressive  power  of  STLOIQ  subsumes  that  of  most  of  the  DLs 
discussed  in  the  literature  or  implemented  in  DL  reasoners; 

—  STLOIQ  is  the  logic  implemented  in  our  FaCT++  system,  and  in  other 
state  of  the  art  DL  reasoners  such  as  Pellet  (Sirin  et  al.,  2005b);1  and 

1  In  fact  Pellet  implements  the  slightly  weaker  logic  SHOUT . 


Optimisations.tex;  27/04/2006;  9:02;  p.2 


Optimising  Expressive  DLs 


3 


—  as  the  basis  of  the  W3C  OWL  web  ontology  language,  STLOIQ  is  now 
very  widely  used  in  practice. 

DLs  usually  distinguish  between  the  terminological  part  of  a  knowledge 
base  (called  the  TBox),  which  describes  the  structure  of  the  domain  of  dis¬ 
course  in  terms  of  concepts  and  roles,  and  the  assertional  part  (called  the 
ABox),  which  describes  some  particular  situation  in  terms  of  instances  of 
concepts  and  roles.  FaCT++  is  primarily  designed  to  support  TBox  reason¬ 
ing,  as  this  kind  of  reasoning  is  widely  used,  e.g.,  in  ontology  design  and 
maintenance  tools  such  as  Protege  (Protege,  2003)  and  Swoop  (Kalyanpur 
et  al.,  2005).  However,  the  expressive  power  of  STLOIQ  is  such  that  it  blurs 
the  usual  distinction  between  TBox  and  ABox,  and  FaCT++  exploits  this 
to  support  ABox  reasoning.  This  simple  approach  would,  however,  clearly 
not  scale  to  very  large  ABoxes,  and  for  a  discussion  of  implementation  and 
optimisation  techniques  designed  to  address  the  problems  of  reasoning  with 
large  ABoxes,  the  reader  is  referred  to  work  on  optimising  ABox  reasoning  in 
the  Racer  system  (Haarslev  and  Moller,  2000;  Haarslev  and  Moller,  2001a). 


2.  Preliminaries 

We  present  here  a  brief  introduction  to  description  logic  syntax  and  seman¬ 
tics  (in  particular  the  syntax  and  semantics  of  STLOIQ ),  and  to  tableaux 
algorithms  for  description  logics  (in  particular  for  STLOIQ).  We  will  not 
attempt  to  give  a  comprehensive  description  of  the  algorithms,  or  present 
any  proofs;  rather  we  will  provide  just  such  details  of  the  basic,  unoptimised 
method  as  will  be  necessary  in  order  to  understand  the  subsequent  sections 
on  optimisation  techniques.  The  interested  reader  is  referred  to  other  work  on 
tableaux  algorithms  (such  as  (Baader  and  Sattler,  2001))  for  further  details  on 
tableaux  algorithms  in  general,  and  the  introduction  of  the  STLOIQ  tableaux 
algorithm  (Horrocks  and  Sattler,  2005)  for  further  details  on  the  STLOIQ 
algorithm. 

2.1.  STLOIQ  Syntax  and  semantics 

STLOIQ  is  a  very  expressive  DL  that,  in  addition  to  the  standard  Boolean 
connectives,  allows  for  transitive  roles,  a  hierarchy  of  sub-  and  super-roles, 
inverse  (sometimes  called  converse)  roles,  qualified  cardinality  constraints, 
and  nominals  (i.e.,  the  ability  to  refer  to  individuals  in  concept  expressions). 
This  last  feature  leads  to  a  blurring  of  the  usual  DL  distinction  between  TBox 
(a  set  of  axioms  concerning  classes  and  roles)  and  ABox  (a  set  of  axioms  con¬ 
cerning  individuals);  we  will  return  to  this  point  when  discussing  the  syntax 
and  semantics  of  STLOIQ  knowledge  bases. 
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DEFINITION  1 .  Let  R  be  a  set  of  role  names  with  both  transitive  and  nor¬ 
mal  role  names  R+  U  Rp  =  R  where  Rp  fl  R+  =  0.  The  set  of  STiOXQ- 
roles  [or  roles  for  short )  is  R  U  {R~  \  R  G  R}.  A  role  inclusion  axiom  is  of 
the  form  R  E  S,for  two  roles  R  and  S.  A  role  hierarchy  is  a  finite  set  of  role 
inclusion  axioms. 

An  interpretation  X  =  (A1 .  ■I)  consists  of  a  non-empty  set  A1,  the  domain 
ofX,  and  a  function  x  which  maps  every  role  to  a  subset  of  A1  x  A 1  such 
that,  for  Pa  R  and  R  G  R+, 

(x,y)  G  P1  iff  (y,x)  G  P~X , 
and  if(x,  y)  G  R1  and  (y,  z)  G  Rx,  then  (x,  z)  G  R1 . 

An  interpretation  X  satisfies  a  role  hierarchy  7 Z  if  R1  C  S1  for  each  R  E 
S  G  TZ:  such  an  interpretation  is  called  a  model  ofTZ. 

We  introduce  some  notation  to  make  the  following  considerations  easier. 

1 .  The  inverse  relation  on  roles  is  symmetric,  and  to  avoid  considering  roles 
such  as  R  ,  we  define  a  function  I  n v  which  returns  the  inverse  of  a  role : 
for  Re  R,  Inv(P)  :=  R~  and  lnv(i?“)  :=  R. 

2.  Since  set  inclusion  is  transitive  and  R1  C  S1  implies  lnv(  R)1  C  lnv(.S')T, 
for  a  role  hierarchy  TZ,  we  introduce  E  n  as  the  transitive-reflexive  clo¬ 
sure  of  E  on  1Z  U  |lnv(i?)  E  Inv(S')  |  R  E  S  G  TZ}.  We  use  R  =n  S  as 
an  abbreviation  for  R  E  nS  and  S  E  nR. 

3.  Obviously,  a  role  R  is  transitive  if  and  only  if  its  inverse  lnv(i?)  is  tran¬ 
sitive.  However,  in  cyclic  cases  such  as  R  =-r  S,  S  is  transitive  if  R 
or  lnv(  R)  is  a  transitive  role  name.  In  order  to  avoid  these  case  distinc¬ 
tions,  the  function  Trans  returns  true  iff  R  is  a  transitive  role — regardless 
whether  it  is  a  role  name,  the  inverse  of  a  role  name,  or  equivalent  to  a 
transitive  role  name  (or  its  inverse):  Trans(5',  'R)  true  if,  for  some  P 
with  P  =  S,  P  G  R+  or  Inv(P)  G  R+;  Trans^,  TZ)  false  otherwise. 

4.  A  role  R  is  called  simple  w.r.t.  TZ  iff  Trans(5',  R)  —  false  for  all  S  EkP. 

5.  In  the  following,  if  TZ  is  clear  from  the  context,  then  we  may  abuse  our 
notation  and  use  E  and  Trans(S’)  instead  of  |R  and  Trans(S’,  TZ). 

DEFINITION  2.  Let  Nc  be  a  set  of  concept  names  with  a  subset  Nj  C 
Nc  of  nominals.  The  set  of  STiOXQ-c oncepts  (or  concepts  for  short)  is  the 
smallest  set  such  that 

1.  every  concept  name  C  G  Nc  is  a  concept, 
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2.  ifC  and  D  are  concepts  and  R  is  a  role,  then  (C  n  D),  (C  U  D),  (->(7), 
(' MR.C ),  and  ( 3R.C )  are  also  concepts  ( the  last  two  are  called  universal 
and  existential  restrictions,  resp.),  and 

3.  if  C  is  a  concept,  R  is  a  simple  role2  and  to  G  IN,  then  '.nR.C )  and 
f^nR.C)  are  also  concepts  ( called  utmost  and  atleast  restrictions,  resp.). 

The  interpretation  function  ■  of  an  interpretation  X  =  (A  ,  ■x)  maps,  addi¬ 
tionally,  every  concept  to  a  subset  of  A1  such  that 

( C  n  D)x  =  C1  n  Dx,  ( C  U  Df  =  C1  U  D1, 

-i  Cx  —  Ax  \  Cx,  )} ox  —  1  for  all  o  G  iV/, 

(: 3R.C)1  =  {x  G  Ax  I  Rx{x,C )  ^  0}, 

(VR.Cf  =  (x  e  Ax  |  Rx(x,  -i(7)  =  0}, 

(tfnR.C)1  —  (x  G  Ax  |  §Rx(x,  C)  ^  to},  and 
l^nR.C)1  =  (x  G  Ax  |  $Rx(x,C)  f  to}, 

where  (1 M  is  the  cardinality  of  a  set  M  and  Rr(x,  C )  is  defined  as  {y  \ 
(x,  y)  G  R1  and  y  G  C1}. 

DEFINITION  3.  A  STTOIQ  knowledge  base  (KB)  is  a  pair  (T,  A),  where 

—  T  ( the  TBox)  is  a  set  of  general  concept  inclusion  ( GCI )  axioms  of  the 
form  C  C  D,  where  C  and  D  are  ( possibly  complex)  concepts,  and 

—  A  (the  ABox)  is  a  set  of  axioms  of  the  form  i  :  C  and  ( i,j )  :  R,  where 
C  is  a  (possibly  complex)  concept,  R  is  a  role,  and  {i,j}  C  Nj  are 
nominals. 

An  interpretation  X  satisfies  a  GCI  C  C  D  if  Cx  C  Dx,  it  satisfies  an 
axiom  i  :  C  if  ix  C  Cx,  and  it  satisfies  an  axiom  ( i,j )  :  R  if  for  some 
(x,  y)  G  Rx,  ix  =  {a:}  and  jx  =  {y}.  An  interpretation  X  satisfies  a  TBox  T 
(resp.  an  ABox  A)  if  it  satisfies  each  axiom  in  X  (resp.  A),  and  it  satisfies  a 
KB  K.  —  (X,  A)  if  it  satisfies  both  X  and  A;  such  an  interpretation  is  called 
a  model  of  X  ( resp.  A,  K, ). 

Note  that  the  use  of  nominals  instead  of  individuals  in  ABox  axioms  leads 
to  the  slightly  non-standard  semantics,  but  is  otherwise  insignificant. 

As  mentioned  above,  nominals  blur  the  distinction  between  TBox  and 
ABox.  ABox  axioms  can  be  transformed  into  TBox  axioms:  it  is  easy  to  see 
that  an  interpretation  X  satisfies  an  axiom  i  :  C  iff  it  satisfies  i  C  C,  and  it 
satisfies  an  axiom  (i,  j)  :  R  iff  it  satisfies  i  C  3-R-j-  It  is,  therefore,  possible 
(and  convenient)  to  think  of  a  STiOXQ  KB  as  consisting  only  of  a  TBox. 

2  Restricting  number  restrictions  to  simple  roles  is  required  in  order  to  yield  a  decidable 
logic  (Horrocks  et  al„  1999). 
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DEFINITION  4.  A  TBox  X  is  satisfiable  w.r.t.  a  role  hierarchy  R  if  there 
is  a  model  X  of  X  and  R.  A  concept  C  is  satisfiable  w.r.t.  a  role  hierarchy 
R  and  a  TBox  X  if  there  is  a  model  X  of  R  and  T  with  C1  ^  0.  Such  an 
interpretation  is  called  a  model  of  C  w.r.t.  R  and  T .  A  concept  D  subsumes 
a  concept  C  w.r.t.  R  and  X  (written  C  Xurr  D)  ifCx  C  D 1  holds  in  every 
model  X  of  R  and  X.  Two  concepts  C,D  are  equivalent  w.r.t.  R  and  X 
(written  C  =n.T  D)  if  they  are  mutually  subsuming  w.r.t.  R  and  X. 

Note  that,  as  usual,  subsumption  and  satisfiability  can  be  reduced  to  each 
other:  C  Etc,t  D  if  (C  n  -i D)  is  not  satisfiable  w.r.t.  R  and  X ;  and  C  is 
not  satisfiable  w.r.t.  R  and  X  if  C  fn.T  -L.  Moreover,  the  satisfiability  of  a 
concept  w.r.t.  a  role  hierarchy  and  TBox  can  be  reduced  to  the  satisfiability  of 
a  TBox  w.r.t.  a  role  hierarchy:  C  is  satisfiable  w.r.t.  R  and  X  if  X  U  {o  C  C} 
is  satisfiable  w.r.t.  R,  where  o  is  a  “fresh”  nominal,  i.e.,  one  that  does  not 
occur  in  X.  When  R  is  obvious  from  the  context  (or  assumed  to  be  empty) 
we  will  talk  about  TBox  satisfiability;  when  both  R  and  X  are  obvious  from 
the  context  (or  assumed  to  be  empty)  we  will  talk  about  concept  satisfiabiliy 
and  subsumption  (written  C  C  !)). 

2.2.  A  Tableaux  Algorithm  for  SHOXQ 

The  basic  idea  behind  the  tableaux  algorithm  for  SHOXQ  TBox  satisfiability 
is  to  take  as  input  a  TBox  X  and  role  hierarchy  R,  and  to  try  to  prove  the 
satisfiability  of  X  w.r.t.  R  by  demonstrating  the  existence  of  a  model  X  of 
X  w.r.t.  R.  This  is  done  by  syntactically  decomposing  X  so  as  to  derive 
constraints  on  the  structure  of  such  a  model.  For  example,  if  a  nominal  o 
occurs  in  T,  then  any  model  of  X  must,  by  definition,  contain  some  individual 
x  such  that  ox  {a'},  and  if  o  C  3 R.D  is  an  axiom  in  X,  then  the  model 
must  also  contain  an  individual  y  such  that  ( x ,  y)  €  R1  and  y  is  an  element 
of  Dt\  if  D  is  non-atomic,  then  continuing  with  the  decomposition  of  D 
would  lead  to  additional  constraints.  The  construction  fails  if  the  constraints 
include  a  clash  (an  obvious  contradiction),  e.g.,  if  some  individual  a  must  be 
an  element  of  both  C  and  ->  C  for  some  concept  C.  The  algorithm  is  designed 
so  that  it  is  guaranteed  to  terminate,  and  guaranteed  to  construct  a  model  if 
one  exists;  such  an  algorithm  is  clearly  a  decision  procedure  for  SHOXQ 
TBox  satisfiability. 

In  practice,  the  algorithm  works  on  a  labelled  graph,  called  a  completion 
graph,  that  has  a  close  correspondence  to  a  model;  this  is  because  SHOXQ 
models  may  be  non-finite  (although  obviously  finitely  representable),  and 
because  it  is  convenient  not  to  explicate  edges  that  may  be  implied  by  transi¬ 
tivity  (of  roles). 

For  ease  of  presentation  we  will,  as  usual,  assume  all  concepts  to  be  in 
negation  normal  form  (NNF).  A  concept  can  be  transformed  into  an  equiv¬ 
alent  one  in  NNF  by  pushing  negation  inwards,  making  use  of  de  Morgan’s 
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laws  and  the  duality  between  existential  and  universal  restrictions,  and  be¬ 
tween  atmost  and  atleast  number  restrictions  (Horrocks  et  al.,  2000).  For  a 
concept  C,  we  use  SC  to  denote  the  NNF  of  -> C,  and  we  use  cl(C')  to  denote 
the  set  of  all  subconcepts  of  C  (including  C).  We  will  also  abuse  our  notation 
by  saying  that  a  TBox  T  is  in  NNF  if  all  the  concepts  occurring  in  T  are  in 
NNF,  and  by  using  cl  (T)  to  denote  the  set  of  all  subconcepts  of  the  concepts 
occurring  in  T,  i.e.,  U(ccb)gt  cK^)  u  cl(£>). 

DEFINITION  5.  Let  1Z  be  a  role  hierarchy  and  T  a  STLOIQ  TBox  in 
NNF.  A  completion  graph  for  T  with  respect  to  7 Z  is  a  directed  graph  G  — 
( V ,  E,L,jL)  where  each  node  x  E  V  is  labelled  with  a  set  L(x)  C  cl(T)  and 
each  edge  (x,  y)  E  E  is  labelled  with  a  set  of  role  names  E((x,  y))  containing 
(possibly  inverse)  roles  occurring  in  T  or  TZ.  Additionally,  we  keep  track  of 
inequalities  between  nodes  of  the  graph  with  a  symmetric  binary  relation  Lf 
between  the  nodes  of  G. 

If  (x,  y)  E  E,  then  y  is  called  a  successor  of  x  and  x  is  called  a  prede¬ 
cessor  ofy.  Ancestor  is  the  transitive  closure  of  predecessor,  and  descendant 
is  the  transitive  closure  of  successor.  A  node  y  is  called  an  R-successor  of 
a  node  x  if,  for  some  R'  with  R!  E  R,  R'  E  L((x,y));  x  is  called  an  R- 
predecessor  ofy  if  y  is  an  R-successor  of  x.  A  node  y  is  called  a  neighbour 
(i?-neighbour)  of  a  node  x  ify  is  a  successor  (R-successor)  of  x  or  if  x  is  a 
successor  (R~  -successor)  of  y. 

For  a  role  S  and  a  node  x  in  G,  we  define  the  set  ofx ’s  S-neighbours  with 
C  in  their  label,  SG(x,  C),  as  follows: 

SG(x,  C )  :=  {y  |  y  is  an  S-neighbour  ofx  and  C  E  E(y)}. 

G  is  said  to  contain  a  clash  if 

1.  for  some  A  E  Nq  and  node  x  of  G,  {A,  ~<A}  C  L(x), 

2.  for  some  node  x  of  G,  nS.C )  E  L(x)  and  there  are  n+1  S-neighbours 

yo, . . .  ,yn  ofx  with  C  E  H(yi)  for  each  0  <  i  <  n  and  yi  Lf  yj  for  each 
0  <  i  <  j  <  n,  or 

3.  for  some  o  E  Nj,  there  are  two  nodes  x,  y  of  G  with  x  f  y  and  o  E 
C(x)  0  L(y). 

If  or , . . . ,  0(  are  all  the  nominals  occurring  in  T,  then  the  tableau  algo¬ 
rithm  starts  with  the  completion  graph  G  =  ({ro,  r i . . . ,  rf},  0,  £,  0)  where 
£(ro)  «  {D}  and  fi (rf)  —  {oi}  for  1  <  i  <  l.  G  is  then  expanded  by 
repeatedly  applying  the  expansion  rules  given  in  Figures  1  and  2,  stopping  if 
a  clash  occurs. 

The  expansion  rules  use  some  auxiliary  terms  and  operations,  defined 
here: 
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Nominal  Nodes  and  Blockable  Nodes  We  distinguish  two  types  of  nodes  in 
G,  nominal  nodes  and  blockable  nodes.  A  node  x  is  a  nominal  node  if  L(x) 
contains  a  nominal.  A  node  that  is  not  a  nominal  node  is  a  blockable  node.  A 
nominal  o  €  N/  is  said  to  be  new  in  G  if  no  node  in  G  has  o  in  its  label. 

Blocking  A  node  x  is  label  blocked  if  it  has  ancestors  x',  y  and  y'  such 
that 

1 .  a:  is  a  successor  of  x'  and  y  is  a  successor  of  y', 

2.  y,  x  and  all  nodes  on  the  path  from  y  to  x  are  blockable, 

3.  E(x)  —  H(y)  and  L(x')  —  L(y'),  and 

4.  E((x',x})  =  L((y',y)). 

In  this  case,  we  say  that  y  blocks  x.  A  node  is  blocked  if  either  it  is  label 
blocked  or  it  is  blockable  and  its  predecessor  is  blocked;  if  the  predecessor  of 
a  blockable  node  x  is  blocked,  then  we  say  that  x  is  indirectly  blocked. 

Generating  and  Shrinking  Rules  and  Safe  Neighbours  The  rules  J>-mlc, 
3-rule,  and  o?-rule  are  called  generating  rules,  and  the  rules  ^-rule  and 
O-rule  are  called  shrinking  rules.  An  /(-neighbour  y  of  a  node  x  is  safe  if  (i) 
x  is  blockable  or  if  (ii)  x  is  a  nominal  node  and  y  is  not  blocked. 

Pruning  When  a  node  y  is  merged  into  a  node  x,  we  “prune”  the  completion 
graph  by  removing  y  and,  recursively,  all  blockable  successors  of  y.  More 
precisely,  pruning  a  node  y  (written  Prune(t/))  in  G  =  ( V,  E,  L,  f  )  yields  a 
graph  that  is  obtained  from  G  as  follows: 

1.  for  all  successors  z  of  y,  remove  (y.  z)  from  E  and,  if  z  is  blockable, 
Prune(z); 

2.  remove  y  from  V. 


Merging  merging  a  node  y  into  a  node  x  (written  Merge(y,x))  in  G  = 
(V,  E,  L.  f  )  yields  a  graph  that  is  obtained  from  G  as  follows: 

1.  for  all  nodes  2  such  that  (z,  y)  €  E 

a)  if  {(x,  z),  {z,  x}}  H  E  —  (/),  then  add  (z,  x)  to  E  and  set  E((z,  x})  — 

b)  if  (z,  x)  e  E,  then  set  £((z,  x))  =  E((z,  x))  U  E ({z,  y)), 

c)  if  (x,z)  €  E,  then  set  E((x,z))  —  E{(x,z))  U  (lnv(5l)  |  S  e 

and 

d)  remove  (z,y)  from  E; 


Optimisations.tex;  27/04/2006;  9:02;  p.8 


Optimising  Expressive  DLs 


9 


n-rule:  if  1.  Cl  n  C2  6  L  (x),  x  is  not  indirectly  blocked,  and 
2.  {Ci,C2}g£(x) 
then  set  £(x)  =  £(x)  U  {Ci,  C2} 

U-rule:  if  1.  C\  U  C2  G  £(x),  x  is  not  indirectly  blocked,  and 
2.  {Ci,  C2}  Pi  £(x)  =  0 

then  set  £(x)  =  £(x)  U  {C}  for  some  C  6  {Ci,  C2} 

3-rule:  if  1.  3 S.C  6  L(x),  x  is  not  blocked,  and 

2.  x  has  no  safe  S-neighbour  y  with  C  6  £(y), 
then  create  a  new  node  y  with  £((x,  y))  =  {S'} 
and  £(y)  =  {C} 

V-rule:  if  1.  VS.C  6  £(x),  x  is  not  indirectly  blocked,  and 
2.  there  is  an  S-neighbour  y  of  x  with  C  <(  £(y) 
then  set  £(y)  =  £(y)  U  {C} 

V+-rule:  if  1.  VS.C  6  £(x),  x  is  not  indirectly  blocked,  and 

2.  there  is  some  R  with  Trans(f?)  and  R\±S , 

3.  there  is  an  f?.-neighbour  y  of  x  with  Vf?.C  (f  L  (y) 
then  set  £(y)  =  L(y)  U  {Vf?.C} 

?-rule:  if  1.  (^roS.C)  6  £(x),  x  is  not  indirectly  blocked,  and  there 
2.  is  an  S-neighbour  y  of  x  with  {C,  GC}  fl  £(y)  =  0 
then  set  £(y)  =  £(y)  U  {C}  for  some  E  €  {C,  GC} 

^-rule:  if  1.  (^nS.C)  €  £(x),  x  is  not  blocked,  and 

2.  there  are  not  n  safe  S-neighbours  y1; . . . ,  y„  of  x  with 
C  6  £(yj)  and  y;  ^  y^  for  1  <  *  <  j  <  n 
then  create  n  new  nodes  yi, . . . ,  yn  with  £((x,  y;))  =  {S}, 
£(y,:)  =  {C},  and  y*  f-  y3  for  1  <  i  <  j  <  n. 

^-rule:  if  1.  (^nS.C)  €  £(z),  z  is  not  indirectly  blocked,  and 
2.  j}SG(z,  C)  >  n  and  there  are  two  S-neighbours 
x,  y  of  z  with  C  6  £(x)  fl  £(y),  and  not  x^t/ 
then  1.  if  x  is  a  nominal  node,  then  Merge(y,  x) 

2.  else  if  y  is  a  nominal  node  or  an  ancestor  of  x, 
then  Merge(x,  y) 

3.  else  Merge(y,  x) 

C-rule:  if  1.  Ci  C  C2  6  T,  x  is  not  indirectly  blocked,  and 
2.{3Ci,C2}n£(x)  =  0 

then  set  £(x)  =  £(x)  U  {C}  for  some  C  6  {GCi,  C2} 


Figure  1.  Basic  tableaux  expansion  rules  for  SFLOIQ 
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2.  for  all  nominal  nodes  z  such  that  ( y ,  z)  €  E 

a)  if  {(x,  z),  (z,  x}}  H  E  —  (/),  then  add  (x,  z)  to  E  and  set  L((x,  z})  — 

£((y,z))> 

b)  if  ( x ,  z)  €  E,  then  set  £((x,  z))  —  £((x,  z))  U  E((y,  z)), 

c)  if  {z,x)  <E  E,  then  set  £((z,  x))  —  L((z,x})  U  {Inv(S')  |  S  € 
£((y,z))}, and 

d)  remove  (y,  z)  from  E; 

3.  set  £(x)  —  £(x)  U  £> (y); 

4.  add  x  z  for  all  2  such  that  y  ^  2;  and 

5.  Prune(y). 

Two  kinds  of  rule  will  be  of  particular  interest  in  the  following  discussion: 
non-deterministic  rules,  such  as  the  U-rule  and  C-rule,  and  generating  rules, 
such  as  the  3-rule  and  ^-rule.  In  practice,  non-deterministic  rules  are  dealt 
with  by  using  backtracking  search  to  investigate  the  completion  graphs  result¬ 
ing  from  each  possible  expansion.  Applying  such  rules  is,  therefore,  likely  to 
be  more  “costly”,  as  they  either  increase  the  size  of  the  graph  or  increase  the 
size  of  the  search  space. 

O-rule:  if  for  some  o  6  Nj  there  are  2  nodes  x,  y  with 
o  £  £(1)  n  £(y)  and  not  x  ^  y 
then  Merge(x,  y) 

0?-rule:  if  1.  nS.C )  6  L(x),  x  is  a  nominal  node,  and  there  is 
a  blockable  S'-neighbour  y  of  x  such  that  C  6  £(y) 
and  x  is  a  successor  of  y 

2.  there  is  no  m  with  1  ^  m  ^  n,  mS.C )  6  £(x), 

and  there  are  m  nominal  S'-neighbours  21 .... ,  zrn  of 
x  with  C  e  £(z;)  and  y;  jk  y.j  for  all  1  <  i  <  j  <  m. 

then  1.  guess  m  ^  n  and  set  £(x)  =  £(x)  U  {(^ mS.C )} 

2.  create  m  new  nodes  2/1 , . . . ,  ym  with 

L((x,  y,))  =  {5},  &(yi)  =  {C,  oj  for  o;  6  AT/ 
new  in  G,  and  y,;  ^  y.j  for  1  <  i  <  j  <  m. 


Figure  2.  Expansion  rules  dealing  with  nominals 
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3.  Preprocessing  and  simplifications 

The  first  group  of  optimisations  in  FaCT++  are  performed  directly  on  the 
syntax  of  the  input  KB.  These  optimisations  serve  to  preprocess  and  sim¬ 
plify  the  KB  into  a  form  more  amenable  to  later  processing.  As  well  as 
simplifications  such  as  tautology  elimination  and  obvious  clash  detection, 
preprocessing  optimisations  (like  absorption.  Section  3.2.5)  can  also  lead  to 
significant  simplification  of  the  subsequent  reasoning  process. 

Note  that  satisfiability  checking  for  expressive  DLs  requires,  in  the  worst 
case,  time  that  is  at  least  exponential  in  the  size  of  the  input,  whereas  most 
preprocessing  optimisations  have  polynomial,  or  even  linear  worst  case  com¬ 
plexity. 

3.1.  Lexical  normalisation  and  simplification 

Descriptions  of  DL  tableau  algorithms,  such  as  the  one  given  in  Section  2, 
typically  assume  that  the  input  is  in  negation  normal  form  (NNF);  this  sim¬ 
plifies  the  (description  of  the)  algorithm,  but  it  means  that  the  first  (and  most 
common)  kind  of  clash,  i.e.,  {C,  ->(7}  C  L(x)  for  some  node  x  in  the  com¬ 
pletion  graph,  will  only  be  detected  when  C  is  a  named  concept.  For  example, 
when  testing  the  satisfiability  of  the  concept  (An£>)  n-i(AnB),  the  transfor¬ 
mation  into  NNF  would  give  ( A  n  /i )  n  ( -■ /I U  -i  /i ) ;  in  practice  this  means  that, 
in  spite  of  the  “obvious”  contradiction,  backtracking  search  will  be  performed 
in  order  to  determine  that  the  concept  is  unsatisfiable. 

For  this  reason,  practical  algorithms  do  not  transform  the  input  into  NNF, 
but  include  a  >-rule  that  performs  a  single  (negation)  normalisation  step  (e.g., 
applying  the  -i-rule  to  -> ( A  n  B)  e  L(x)  would  cause  -> A  U  to  be  added 
to  L(rr)),  and  the  completion  graph  is  said  to  contain  a  clash  if  it  contains 
a  node  x  with  {C,  ^C}  C  L ( x )  for  an  arbitrary  concept  C.  Moreover,  in 
order  to  facilitate  the  detection  of  such  clashes,  the  input  is  normalised  and 
simplified  so  that  logically  equivalent  concepts  are  more  often  syntactically 
equivalent.  This  is  achieved  by  (recursively)  applying  a  set  of  rewrite  rules 
to  concepts  expressions,  and  by  ordering  conjuncts  w.r.t.  some  total  ordering. 
For  example,  we  re-write  U  and  3  concepts  as  negated  n  and  V  concepts, 
respectively;  we  merge  several  conjunctions  together;  we  order  conjuncts; 
and  we  use  logical  equivalences  and  semantics  preserving  transformations 
in  order  to  simplify  concepts.  These  equivalences  and  transformations  can 
be  split  into  three  groups:  constant  elimination,  syntactic  equivalences  and 
semantic  transformations. 

The  constant  elimination  group  includes  the  following  equivalences: 

(<7  n  T)  =  <7,  (Cnl)  e  _L,  VR.T  =  T. 

The  syntactic  equivalence  group  includes  the  following  equivalences: 

(<7  n  C)  =  <7,  -1-1(7  =  (7,  (7  n  -.(7  =  _L,  >  IR.C  =  3R.C 
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The  semantic  transformation  group  exploits  semantic  information  already 
gathered  during  the  preprocessing  phase  to  simplify  concept  expressions.  For 
example,  if  the  KB  contains  an  axiom  A  O  B  for  named  concepts  A  and  B, 
then  the  concept  expression  An  B  can  be  simplified  to  A.  The  same  is  true 
for  arbitrary  concept  expressions. 

Rewrite  rules  can  be  separated  into  two  classes:  “cheap”  ones  and  “ex¬ 
pensive”  ones.  Cheap  ones  (i.e.,  C  n  1  — >  _L)  are  easily  applied  and  almost 
always  give  a  positive  effect.  They  are,  therefore,  applied  to  every  concept 
expression  appearing  in  the  KB. 

Expensive  rules  require  more  effort  to  recognise  when  they  are  applicable 
(i.e.,  A  n  B  — >  A  when  A  C  B),  and  may  not  give  a  positive  effect  at 
all.  Indeed,  a  well  designed  KB  is  likely  to  include  few  such  constructions, 
so  applying  such  rule  to  the  whole  (possibly  huge)  KB  will  be  a  waste  of 
time.  Flowever,  there  are  special  cases  in  which  it  is  crucial  to  have  concept 
expression  be  as  simple  as  possible.  Such  cases  include  unabsorbed  axioms, 
absorbed  concept  expressions  and  role  ranges  and  domains  (all  of  which  will 
be  explained  below).  In  these  cases,  all  simplification  rules  (including  the 
expensive  ones)  are  applied. 

3.2.  Dealing  with  axioms 

If  dealt  with  naively,  TBox  axioms  can  lead  to  a  serious  degradation  in  reason¬ 
ing  performance,  as  each  such  axiom  would  cause  a  disjunction  to  be  added  to 
every  node  of  the  completion  graph,  leading  to  potentially  enormous  amounts 
on  nondeterministic  expansion  and  backtracking  search. 

For  example,  given  a  TBox  T,  if  T  CieT,  with  A  —  (( C\  U  D\)  n 
. . .  n  ( Cn  U  On)),  and  testing  the  satisfiability  of  T  leads  to  the  construction 
of  a  completion  graph  containing  k  nodes,  then  there  are  2kn  different  ways 
to  apply  the  n-  and  U-rules  to  the  resulting  k  copies  of  A.  This  explosion  in 
the  size  of  the  search  space  can  easily  lead  to  a  catastrophic  degradation  in 
performance,  even  when  optimisations  such  as  backjumping  (see  Section  4.3) 
and  caching  (see  Section  4.6)  are  employed  (Florrocks,  1997). 

Fortunately,  optimisations  known  as  lazy  unfolding  and  absorption  have 
proved  to  be  very  effective  in  reducing  the  size  of  the  search  space. 

3.2.1.  Lazy  Unfolding 

TBox  axioms  are  often  (restricted  to  be)  of  the  form  A  C  C  or  /I  =  C  for 
some  concept  name  A  (where  A  =  C  is  an  abbreviation  for  the  pair  of  GCIs, 
A  O  C  and  ->A  C  ~>C).  Such  axioms  are  often  called  definitional,  as  they 
can  be  thought  of  as  defining  the  meaning  of  A.  A  TBox 

T  —  {A\  =  Ci,...,  At  =  Ci,  A(. |_i  C  Ci+ 1, . . . ,  Ai+m  O  Ci+m} 

is  said  to  be  unfoldable,  if  it  satisfies  the  following  conditions. 
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—  All  axioms  in  T  are  definitional. 

—  Axioms  in  T  are  unique.  I.e.,  for  each  concept  name  A,  T  contains  at 
most  one  axiom  of  the  form  A  =  C  (i.e.,  A,  f  Aj  for  1  ^  i  <  j  ^  £), 
and  if  it  contains  an  axiom  of  the  form  A  =  C,  then  it  does  not  contain 
any  axiom  of  the  form  A  C  C.  (Note  that  an  arbitrary  set  of  axioms 
{A  C  Ci, . . . ,  A  C  Cn }  can  be  combined  into  a  single  axiom  A  C 
Ci  n  . . .  n  Cn). 

—  T  is  acyclic.  I.e.,  there  is  no  axiom  Ai  =  Ci  €  T  such  that  A,  occurs 
either  directly  or  indirectly  in  Ci?  A  concept  name  A  occurs  indirectly 
in  a  concept  expression  C  if  there  is  a  concept  name  A'  such  that  A' 
occurs  directly  in  C,  and  there  is  an  axiom  A'  =  C  €  T  such  that  A 
occurs  either  directly  or  indirectly  in  C' . 

Instead  of  being  dealt  with  using  the  C-rule,  such  a  set  of  axioms  can  be 
lazily  unfolded  during  the  tableau  expansion.  I.e.,  for  an  axiom  A\  C  Ci  € 
T,  if  Ai  is  added  to  H(x)  for  some  node  x,  then  C,  is  also  added  to  A(x),  and 
for  an  axiom  Aj  =  Cj  €  T,  if  Aj  CAf)  is  added  to  H(x)  for  some  node  x, 
then  Cj  (resp.  ->Cj)  is  also  added  to  L(x). 

It  is  obvious  that  an  arbitrary  TBox  T  can  be  divided  into  an  unfoldable 
part  Tv  and  a  general  part  Tg  such  that  Tu  U  Tg  —  T  and  Tu  fl  Tg  =  0. 
The  unfoldable  part  Tu  can  then  be  dealt  with  using  lazy  unfolding  while 
the  general  part  Tg  is  dealt  with  using  the  L-rule.  In  fact  it  has  been  shown 
that  the  definition  of  an  unfoldable  TBox  can  be  extended  somewhat  while 
still  allowing  the  use  of  the  above  lazy  unfolding  technique.  In  particular, 
the  concept  expressions  occurring  on  the  left  hand  side  of  subsumptions  and 
equivalences  can  also  be  negated  named  concepts,  and  the  acyclicity  condi¬ 
tion  can  be  relaxed  by  distinguishing  positive  and  negative  occurrences  of 
named  concepts  in  a  stratified  theory  (Horrocks  and  Tobies,  2000b;  Lutz, 
1999). 

Lazy  unfolding  can  be  viewed  as  a  modification  of  the  tableaux  expansion 
rules,  replacing  the  C-rule  with  two  rules,  one  for  unfoldable  axioms  and  one 
for  general  axioms.  These  two  rules,  the  C, ,-rule  and  the  Cff-rule  are  given  in 
Figure  3. 

A  form  of  lazy  unfolding  can  also  be  used  to  deal  more  efficiently  with  so 
called  range  and  domain  constraints.  These  often  arise  in  KBs  derived  from 
ontologies,  where  it  is  common  to  state,  e.g.,  that  the  role  “drives”  has  domain 
“adult”  and  range  ’’vehicle”,  where  adult  and  vehicle  are  concepts,  and  where 
the  intuitive  meaning  is  that  only  adults  can  drive,  and  that  only  vehicles 
can  be  driven.  This  can  easily  be  expressed  as  STIOIQ  axioms  of  the  form 

3  For  the  purposes  of  lazy  unfolding,  only  cycles  consisting  entirely  of  =  axioms  are 
problematical. 
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C„-rule:  if  1.  C\  6  £(x),  C\  C  C2  6  Tu,  x  is  not  indirectly  blocked,  and 
2.  {C2}  Pi  &(x)  =  0 

then  set  £(*)  =  L(x)  U  {C2} 

C9-rule:  if  1.  C\  C  C2  £  Tg,  x  is  not  indirectly  blocked,  and 
2.  {-Ci,C2}n£(x)  =  0 

then  set  £(x)  =  £(x)  U  {C}  for  some  C  6  {ACi,  (72} 


Figure  3.  Tableaux  expansion  rules  for  unfoldable  and  general  axioms 


R-rule:  if  l.TC  MS.C  6  Tr,  x  is  not  indirectly  blocked,  and 
2.  there  is  an  S'-neighbour  y  of  x  with  C  £(y) 
then  £(j/)  — ■>  L(y)  U  {C} 


Figure  4.  Tableaux  expansion  rule  for  domain  and  range  axioms 


T  C  \/R~ .C  (to  state  that  the  domain  of  R  is  C,  e.g.,  T  C  Vdrives-. adult), 
and  T  C  M II. C  (to  state  that  the  range  of  R  is  C,  e.g.,  T  C  Vdrives. vehicle). 

Such  axioms  are  not  unfoldable,  and  will  therefore  be  dealt  with  by  the 
C-rtile.  For  an  axiom  T  C  MR.C,  this  would  lead  to  ->T  U  M II. C  being 
added  to  the  label  of  each  node.  This  can  clearly  be  simplified  to  MR.C,  so 
there  is  no  need  for  non-deterministic  expansion.  Even  so,  when  there  are 
very  large  numbers  of  range  and  domain  axioms  (which  is  the  case  in  some 
KBs  derived  from  ontologies,  where  it  is  common  practice  to  specify  the 
range  and  domain  of  every  role),  this  may  lead  to  a  significant  degradation  of 
performance  simply  due  to  the  large  size  of  node  labels  (and  using  complex 
data  structures  for  node  labels  is  also  problematical  due  to  the  saving  and 
restoring  that  is  needed  during  backtracking  search).  Extending  the  tableaux 
algorithm  to  lazily  unfold  range  and  domain  axioms  not  only  deals  with  this 
problem,  but  also  provides  additional  opportunities  for  applying  the  important 
absorption  optimisation  (see  Section  3.2.5). 

It  is  obvious  that  an  arbitrary  TBox  T  can  be  divided  into  three  parts:  an 
unfoldable  part  Tu,  a  range  and  domain  part  %,  and  a  general  part  Tg,  such 
that  T  =  7^U7^.U7^,  and  Tu,  Tr  and  Tg  are  pairwise  disjoint.  The  unfoldable 
part  Tu  is  as  before,  the  range  and  domain  part  7)  consists  of  all  the  axioms 
of  the  form  T  C  M II. C  that  would  formerly  have  been  in  Tg  (note  that  R  can 
be  an  inverse  role),  and  Tg  consists  of  the  remaining  general  axioms.  A  new 
tableaux  expansion  rule,  the  R-rule,  is  added  in  order  to  deal  with  7j.;  this 
rule  is  given  in  Figure  4. 
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3.2.2.  Synonym  replacement 

Assume  that  the  only  axiom  with  named  concept  A  on  the  left  hand  side 
is  A  =  B,  where  B  is  also  a  named  concept.  This  means  that  there  exist 
two  names  for  the  same  concept.  This  redundancy  in  the  TBox  may  prevent 
the  application  of  further  optimisations,  and  may  introduce  “fake”  cycles  and 
dependencies  into  the  TBox.  To  avoid  this,  we  apply  synonym  replacement. 

If  A  =  B  6  Tu,  then  A  is  called  a  synonym  of  B,  B  is  called  the  represen¬ 
tative  concept  of  A,  and  the  axiom  A  =  B  is  called  the  synonym  definition 
of  A.  During  synonym  replacement,  all  synonyms  occurring  in  the  TBox  (ex¬ 
cept  in  synonym  definitions)  are  replaced  with  their  representative  concepts. 
After  this,  other  simplifications,  such  as  those  described  in  Section  3.1,  can  be 
applied.  For  example,  the  concept  expression  A  n  -> B,  where  A  is  synonym 
of  B,  would  be  transformed  to  B  n  -> B,  which  can  be  further  simplified  to 

3.2.3.  Told  Cycle  Elimination 

The  idea  of  a  told  subsumer  is  widely  used  in  DL  reasoning,  especially  in 
classification  algorithms  (see  Section  5  below).  Informally,  a  named  concept 
C  has  a  told  subsumer  D  if  C  C  D  is  “obvious”  from  the  syntactic  structure 
of  the  KB.  This  information  is  useful  in  classification,  because  it  provides 
some  initial  information  about  subsumption  relations. 

More  formally,  a  concept  D  appears  in  the  expression  C,  if  either  C  —  D, 
or  C  —  C\  n . . .  n  Cn  and  D  appears  in  the  expression  C,  for  some  1  <  i  <  n. 
A  named  concept  B  is  called  an  immediate  told  subsumer  (ITS)  of  a  named 
concept  A,  written  B  €  ITS  (A),  iff: 

—  AQCeT  or  A  =  CeT  and  B  appears  in  the  expression  C; 

—  AQCcT  or  A  =  CcT,  3R.D  appears  in  the  expression  C, 
T  C  V  In v(R).E  6  %  and  B  appears  in  the  expression  E; 

—  A  Q  C  C  T  or  A  =  C  £  T,  >  nR.D  appears  in  the  expression  C, 
T  C  V  In v(R).E  6  %  and  B  appears  in  the  expression  E. 

Told  subsumer  is  the  transitive  closure  of  ITS. 

A  TBox  T  has  a  definitional  cycle  if,  for  some  A  e  Nc,  A  is  a  told 
subsumer  for  itself.  The  presence  of  definitional  cycles  in  the  TBox  can  lead 
to  several  problems,  and  in  particular  can  cause  problems  for  algorithms  that 
exploit  the  told  subsumer  hierarchy  (i.e.,  the  concept  hierarchy  that  is  implied 
by  told  subsumer  relations).  Definitional  cycles  also  make  some  GCI  simpli¬ 
fications  inapplicable.  These  cycles  are,  however,  often  quite  easy  to  elimi¬ 
nate.  We  assume  that  most  modern  reasoners  include  such  an  optimisation, 
although  we  know  of  no  reference  to  it  in  the  literature. 

Assume  A\ . . .  An  are  named  concepts,  such  that  Al+  \  e  ITS{Ai)  for  all 
1  <  i  <  n,  and  A\  —  An.  In  this  case,  all  A,  are  equivalent.  We  can  chose  A\ 
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as  a  representative  concept,  make  A2, ,  An~  1  synonyms  of  A 1  by  adding 
axioms  Ai  =  A  t  to  Tu,  and  run  the  synonym  replacement  algorithm  on  T. 

3.2.4.  Redundant  subsumption  elimination 

After  applying  told  cycle  elimination,  some  axioms  might  include  redundant 
information  due  to  synonym  replacement.  For  example,  if  A  C  B  n  C  €  T, 
and  after  told  cycle  elimination  B  =  A  was  added  to  Tu,  then  the  axiom 
would  be  transformed  into  A  O  A  n  C,  where  the  /I  in  /I  n  C  is  clearly 
redundant.  Redundant  subsumption  elimination  is  a  simple  optimisation  that 
removes  this  kind  of  redundancy. 

The  optimisation  works  as  follows.  For  every  axiom  A  C  C\  n . . .  n  Cn  € 
Tu,  and  for  1  <  i  ^  n,  if  C,  =  A,  then  C)  is  replaced  with  T ;  the  usual 
syntactic  simplifications  are  then  applied  to  the  axiom. 

Note  that  this  optimisation  is  different  from  told  cycle  elimination  (Sec¬ 
tion  3.2.3),  although  there  are  some  axioms  that  can  be  simplified  using  either 
optimisation;  the  axiom  A  Q  An  would,  for  example,  be  simplified  to 
A  C  -1 B  by  either  optimisation.  Told  cycle  elimination  cannot,  however,  deal 
with  axioms  like  A  O  -1 A  U  C,  and  redundant  subsumption  elimination  is  not 
applicable  if  several  axioms  are  involved  in  a  cycle. 

3.2.5.  Absorption 

Given  the  effectiveness  of  lazy  unfolding  in  dealing  with  the  unfoldable  part 
of  a  TBox  Tu  and  the  range  and  domain  axioms  in  7)  ,  it  makes  sense  to  try  to 
rewrite  the  axioms  in  T  so  that  the  size  of  Tg  can  be  reduced. 

Absorption  is  such  a  rewriting  optimisation  that  tries  to  eliminate  GCIs 
in  Tg  by  absorbing  them  into  concept  definitions  in  Tu  ( concept  absorption ) 
or  domain  axioms  in  Tr  (role  absorption).  This  is  usually  done  by  rewriting 
generals  axiom  in  an  equivalent  form  suitable  for  one  of  these  absorptions:  for 
concept  absorption,  the  axiom  should  be  of  the  form  CN  C  D,  where  CN 
is  a  named  primitive  concept  and  D  is  an  arbitrary  concept  expression  (Hor- 
rocks,  2003);  for  role  absorption,  the  axiom  should  be  of  the  form  3i?.T  C  D, 
where  D  is  an  arbitrary  concept  expression  (Tsarkov  and  Florrocks,  2004).  In 
addition,  a  special  form  of  concept  absorption,  called  nominal  absorption,  can 
be  employed  when  an  axiom  has  form  U. .  .1  on  C  I),  or  3R.o  C  D,  where 
o,  01 , . . . ,  on  are  nominals  and  D  is  an  arbitrary  concept  expression  (Sirin 
et  al.,  2005a). 

Given  a  TBox  T,  absorption  proceeds  as  follows.  First,  Tu,  Tr  and  Tg 
are  initialised  such  that  Tr  =  l,  Tu  U  Tg  =  T  and  Tu  is  unfoldable.  This 
can  be  trivially  achieved  by  setting  Tg  =  T  and  Tu  —  %  =  0,  but  it  is 
usually  best  to  try  to  maximise  the  number  of  definitional  axioms  in  Tu,  and  in 
particular  to  maximise  the  number  of  definitional  axioms  of  the  form  A  =  D 
in  Tu  (Horrocks  and  Tobies,  2000b).  Due  to  the  uniqueness  and  acyclicity 
restrictions,  however,  there  may  be  no  unique  maximal  Tu. 
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Next,  the  axioms  in  Tg  are  normalised  by  rewriting  them  as  semantically 
equivalent  axioms  of  the  form  T  C  C,  i.e.,  A  jZ  /i  is  rewritten  as  T  C 
C,  where  C  —  (B  U  - ^ A ) .  The  concepts  occurring  in  such  axioms  can  be 
simplified  using  the  techniques  described  in  Section  3.1,  and  trivial  axioms 
can  be  dealt  with  as  follows: 

—  T  CTis  trivially  satisfiable  and  can  be  removed  from  the  TBox. 

—  T  C  _L  is  trivially  unsatishable  and  leads  directly  to  TBox  unsatishabil- 
ity. 

Finally,  a  range  of  rewriting  rules  can  be  applied  to  axioms  in  Tg  in  order 
to  transform  them  into  a  suitable  form,  and  then  add  them  to  either  Tu  or 
These  rules  are  repeatedly  applied  until  either  Tg  is  empty  or  no  further 
rules  are  applicable.  Note  that  it  is  important  to  first  eliminate  told  cycles,  as 
described  in  Section  3.2.3,  otherwise  application  of  the  rewriting  rules  may 
not  terminate. 

Axiom  transformation  rules: 

—  T  C  B  U  C,  where  B  is  a  named  concept  with  B  =  D  6  Tu,  can 
be  rewritten  as  T  C  D  U  C. 

—  TC  —if?  U  C,  where  B  is  a  named  concept  with  B  =  D  6  Tu,  can 
be  rewritten  as  T  C  ->  D  U  C. 

—  TC  (D\  n  1)2)  U  C  can  be  rewritten  as  two  axioms  T  C  D\  U  C 
and  T  C  D2  U  C. 

Concept  absorption: 

—  TC  -1 A  U  C,  where  A  is  a  named  concept  with  A  C  D  6  Tu,  can 
be  absorbed  into  Tu  by  removing  T  C  -  t  A  U  C  from  Tg  and  adding 
.1:  (AoI„. 

Role  absorption: 

—  TC  (Vff.C)  U  D  can  be  absorbed  into  %.  by  removing  it  from  Tg 
and  adding  T  C  V  Inv(ff). ((Vff.C)  U  D )  to  Tr. 

—  T  C  (<  nff.C)  U  D  can  be  absorbed  into  Tr  by  removing  it  from 
Tg  and  adding  T  C  V  lnv(ff).((<  nff.C)  U  D )  to  Tr. 

Nominal  absorption: 

—  T  C  C,  where  C  —  -(^i  U . on)  U  D,  can  be  absorbed  into 

Tv  by  removing  TCC  from  Tg  and  adding  o,  L  C  to  7),  for  all 

1  <  i  <  n. 
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—  TCC,  where  C  —  -i(3i?.(oi  U  . . .  U  on))  U  D,  can  be  absorbed 
into  Tu  by  removing  T  C  C  from  Tg  and  adding  o,  C  V lnv(i?).C 
to  Tu  for  some  1  <  i  <  n. 

Note  that  there  may  be  many  different  ways  to  apply  these  rewriting  rules, 
some  of  which  may  eventually  lead  to  different  absorptions.  Defining  what 
constitutes  a  “good”  absorption  is  still  an  open  problem,  but  an  absorption 
that  leaves  Tg  empty  is  invariably  better  in  practice  than  one  which  does 
not  (Horrocks  and  Tobies,  2000a;  Tsarkov  and  Horrocks,  2004). 


4.  Optimisations  in  Core  Satisfiability  Testing 

The  core  of  FaCT++  is  its  KB  satisbability  algorithm,  which  implements  a 
highly  optimised  version  of  the  tableaux  algorithm  described  in  Section  2.2. 
But  before  introducing  used  optimisations,  we  remind  in  short  “standard” 
implementation  on  the  tableaux  algorithm  for  testing  concept  satisfiability. 

4.1.  Standard  Satisfiability  Algorithm 

It  is  easy  to  prove  termination  and  correctness  of  the  tableaux  algorithm  pre¬ 
sented  in  Section  2.2,  but  practical  applicability  is  limited.  No  practical  ways 
to  deal  with  non-deterministic  rules  are  given.  All  this  lead  us  to  define  “stan¬ 
dard”  satisfiability  testing  algorithm  which  is  based  on  theoretical  description 
from  Section  2.2. 

One  point  worth  to  note  is  that  concept  expressions  does  not  usually  trans¬ 
formed  to  NNF.  Instead,  as  we  described  in  Section  3.1,  algorithms  used 
simplified  normal  form  together  with  an  additional  -i-rule,  which  allows  al¬ 
gorithms  to  find  clashes  faster. 

Real-life  algorithms  usually  keeps  track  of  changes  in  the  completion 
graph  (i.e.,  which  concepts  were  added  to  node  labels,  or  creation  new  edges 
in  a  completion  graph)  and  maintain  some  ordering  of  concepts  that  have  to 
be  expanded.  As  a  the  same  time  several  expansion  rules  might  be  applicable, 
so  some  order  of  applying  rules  may  be  chosen. 

The  other  thing  that  differs  real  algorithm  from  theoretical  description  is 
the  way  it  deals  with  branching  rules. 

For  every  expansion  of  branching  rule  algorithm  creates  a  branching  point , 
in  which  it  saves  state  of  a  reasoning  process  (including  completion  graph, 
chosen  alternative  in  branching  concept,  etc),  apply  chosen  decision  and  con¬ 
tinue  reasoning  process.  If  later  on  clash  occurs,  algorithm  backtracks  to  the 
latest  branching  point,  restores  state  and  apply  next  alternative.  If  all  possible 
choices  from  given  branching  rule  leads  to  clash,  algorithm  returns  to  the 
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previous  branching  point.  If  backtrack  happens  in  the  first  created  branching 
points,  then  building  of  a  completion  graph  fails,  and  concept  is  unsatisfiable. 

So,  there  exists  two  main  sources  of  non-determinism  in  tableaux  algo¬ 
rithm:  choice  of  an  expansion  rule  to  apply,  and  choose  an  alternative  in  a 
branching  rule.  Most  optimisations  that  are  used  in  real  implementations  are 
targeted  these  two. 

4.2.  Ordering  of  Expansion  Rules 


Most  systems  use  (modified)  top-down  approach  for  ordering  expansion 
rules.  In  this  approach  generating  rules  (like  >-rule  and  3-rule)  are  applied 
after  all  other  rules.  This  comes  from  times  where  description  logics  were 
weaker  and  hence  more  tractable.  For  some  subsets  of  STLOIQ ,  like  STLF 
(logic  with  simplified  number  restrictions  and  without  inverse  roles  and  nom- 
inals),  such  strategy  might  be  used  for  creating  algorithms  with  polynomial 
size  (so-called  trace  technique).  In  short,  algorithm  builds  the  only  branch 
of  a  completion  tree  (for  such  logics  completion  graph  is  tree-shaped),  fully 
expand  it  and  then  discard  fully  expanded  sub-tree  in  order  to  reuse  space  for 
the  other  branches. 

Note  that  trace  technique  can  not  be  used  in  presence  of  inverse  roles,  in 
particular  because  concepts  that  involve  inverse  roles  may  propagate  infor¬ 
mation  back  from  fully  expanded  subtrees.  However,  most  of  algorithms  still 
use  the  same  (accordingly  modified)  schema. 

The  FaCT++  system  was  designed  with  the  intention  of  implementing 
DLs  that  include  inverse  roles,  and  of  investigating  new  optimisation  tech¬ 
niques,  including  new  ordering  heuristics. 

Instead  of  the  top-down  approach,  FaCT++  uses  a  ToDo  list  to  control  the 
application  of  the  expansion  rules.  The  basic  idea  behind  this  approach  is  that 
rules  may  become  applicable  whenever  a  concept  is  added  to  a  node  label. 
When  this  happens,  a  note  of  the  node/concept  pair  is  added  to  the  ToDo  list. 
The  ToDo  list  sorts  all  entries  according  to  some  order,  and  gives  access  to 
the  “first”  element  in  the  list.  A  given  tableaux  algorithm  takes  an  entry  from 
the  ToDo  list  and  processes  it  according  to  the  expansion  rule(s)  relevant  to 
the  entry  (if  any).  During  the  expansion  process,  new  concepts  may  be  added 
to  node  labels,  and  hence  entries  may  be  added  to  the  ToDo  list.  The  process 
continues  until  either  a  clash  occurs  or  the  ToDo  list  become  empty. 

In  FaCT++  the  ToDo  list  architecture  is  implemented  as  a  priority  queue. 
It  is  possible  to  set  a  priority  for  each  rule  type,  and  whenever  entry  would  be 
added  to  the  queue,  it  inserts  after  already  existing  entries  with  same  of  higher 
priorities.  This  means  that  if  the  3-rule  (the  generating  rule  that  expands 
existential  restrictions)  has  a  low  priority  (say  0),  and  all  other  rules  have 
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the  same  priority  (say  1),  then  the  expansion  will  be  (modulo  inverse  roles) 
top-down  and  breadth  first. 

This  architecture  has  a  number  of  advantages  when  compared  to  the  top- 
down  approach.  Firstly,  it  is  applicable  to  a  much  wider  range  of  logics, 
including  the  expressive  logics  implemented  in  modern  systems,  because  it 
makes  no  assumptions  about  the  structure  of  the  graph  (in  particular,  whether 
tree-shaped  or  not),  or  the  order  in  which  the  graph  will  be  constructed.  Sec¬ 
ondly,  it  allows  for  the  use  of  more  powerful  heuristics  that  try  to  improve 
typical  case  performance  by  varying  the  global  order  in  which  different  syn¬ 
tactic  structures  are  decomposed;  in  a  top-down  construction,  such  heuristics 
can  only  operate  on  a  local  region  of  the  graph — typically  a  single  node. 

Empirical  analysis  shows  (Tsarkov  and  Florrocks,  2005b)  that  the  best 
ordering  for  the  expansion  rules  application  is  whether  U-rule  has  the  lowest 
priority,  generating  rules  has  second-lowest  priority,  and  then  all  other  rules 
has  the  same  priority  (except  for  o-rule  and  o?-rule,  that  should  have  higher 
priority  in  order  to  ensure  termination  of  the  algorithm). 

4.3.  Dependency-directed  backtracking  (Backjumping) 

Consider,  for  example,  the  following  concept: 

(Ci  u  Di)  n  . . .  n  (c„  u  Dn)  n  3R.(A  nfi)n  VR.^A. 

In  a  classic  top-down  architecture  the  disjunctions  would  be  expanded  before 
the  existential.  So  the  tableaux  algorithm  would  first  expand  n  disjunctions 
and  then  find  a  clash  due  to  the  3— rule.4  In  the  case  of  normal  (unoptimised) 
backtracking,  2"  choices  of  different  disjunctions  expansions  would  be  done 
before  the  concept  would  be  determined  as  unsatisfiable. 

To  avoid  an  exponential  search  during  checking  the  satisfiability  of  C  and 
similar  concepts,  a  more  sophisticated  solution  is  required,  and  can  be  found 
by  adapting  a  form  of  dependency  directed  backtracking  called  backjump¬ 
ing,  which  has  also  been  used,  e.g.,  in  solving  constraint  satisfiability  prob¬ 
lems  (Baker,  1995)  and  (in  a  slightly  different  form)  in  the  HARP  theorem 
prover  (Oppacher  and  Suen,  1988). 

Intuitively,  backjumping  works  by  labelling  each  concept  C  in  the  la¬ 
bel  of  a  node  x  with  a  dependency  set  Depc{x )  indicating  the  branching 
points  (i.e.,  applications  of  the  a  branching  rule)  on  which  it  depends.  In  case 
the  completion  tree  contains  some  node  x  with  {C,  ->(7}  €  £(x),  we  use 
Depc{ x)  and  Dep-,c(x )  to  identify  the  most  recent  branching  point  b  on 
which  either  C  or  ->C  depends.  The  algorithm  can  then  jump  back  to  b  over 

4  If  the  ToDo  list  algorithm  is  used  with  the  priority  of  3  higher  than  priority  of  U,  clash 
would  be  found  after  two  expansion  of  3-rule.  However,  this  example  can  be  easily  modified 
to  have  an  exponential  behaviour  even  in  such  a  case. 
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intervening  branching  points  without  exploring  any  alternative  branches  (non- 
deterministic  choices),  and  make  a  different  non-deterministic  choice  which 
might  not  lead  to  the  same  closure  condition  being  encountered.  In  case  no 
such  h  exists,  the  closure  did  not  depend  on  any  non-deterministic  choice,  and 
the  algorithm  stops. 

4.4.  Optimisations  of  Disjunction  Processing 

4.4. 1 .  Boolean  constraint  propagation  ( BCP) 

As  well  as  the  standard  tableau  expansion  rules  described  in  Section  2,  addi¬ 
tional  inference  rules  can  be  applied  to  the  concept  occurring  in  a  node  label, 
usually  with  the  objective  of  simplifying  them  and  reducing  the  number  of 
U-rule  applications.  The  most  commonly  used  simplification,  often  called 
Boolean  Constraint  Propagation  (BCP)  (Freeman,  1995),  is  derived  from 
SAT  solvers,  where  it  is  usually  used  in  conjunction  with  the  Davis-Putnam 
procedure.  The  basic  idea  is  to  identify  a  disjunction  C\  U  . . .  U  Cn  €  H(x) 
such  that  the  negations  of  all  but  one  of  the  Cj  are  already  elements  of  li(x); 
when  this  is  the  case,  the  disjunction  can  be  deterministically  expanded  by 
adding  the  relevant  Cj  to  L(x).  This  amounts  to  applying  the  following 
inference  rule 

. . . ,  ~<Cn,  C\  U  . . .  U  Cn  U  C 

c 

to  the  concept  in  a  node  label. 

Note  that,  as  with  the  more  sophisticated  search  techniques  described 
above,  careful  consideration  needs  to  be  given  to  the  dependencies  of  con¬ 
cepts  added  by  such  inference  rules  if  they  are  to  be  used  together  with 
backjumping. 

4.4.2.  Semantic  Branching 

Assume  expansion  of  the  following  concept  (for  simplicity,  let  disjunctions 
will  be  expanded  before  an  existential;  assume  also,  that  C  would  be  the  first 
choice  in  all  the  disjunctions): 

(C  U  Di)  n  . . .  n  (C  U  Dn)  n  3 R.VR~.^C. 

On  the  1st  branching  point  C  was  added  to  a  label  of  a  node.  All  other 
disjunctions  became  expanded  (expansion  rule  wouldn’t  run  because  part  of 
a  disjunction  (namely,  C )  contains  in  a  label  of  a  node).  Further  expansion 
lead  to  a  clash,  because  ->(7  would  be  propagated  to  a  node  after  application 
of  3-rule  and  V-rule.  After  backtracking,  l)\  would  be  chosen  from  the  1st 
disjunction.  But  then  C  would  be  chosen  from  the  2nd  disjunction,  and  the 
story  continues.  In  that  case,  3 n  expansion  rules  would  be  applied  before 
clash-free  completion  graph  would  be  obtained. 
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Semantic  Branching  (Giunchiglia  and  Sebastiani,  1996)  allows  algorithm 
to  avoid  such  situation.  During  expansion  ofCUD  concept  in  the  node  x, 
it  first  tries  to  add  C  to  L{x).  If  this  attempt  fails,  next  try  would  be  made 
with  adding  {D,  -> C}  to  L(x).  This  prevents  following  attempts  to  add  the 
(failing)  concept  C  to  the  label  of  x. 

In  an  example  given  in  the  beginning  of  a  section,  after  failed  expansion 
of  C,  algorithm  adds  {-> C,  I )  \ }  to  the  node  label.  All  other  disjunctions  will 
be  expanded  deterministically  using  BCP.  In  order  to  build  completion  graph, 
n  +  4  expansion  rules  would  be  applied. 

4.4.3.  Heuristics  for  Choosing  Expansion  Ordering 

It  is  well  known  that  different  order  of  expanding  non-deterministic  rule 
can  result  in  huge  (up  to  several  orders  of  magnitude)  difference  in  reason¬ 
ing  performance  (Tsarkov  and  Horrocks,  2005b).  Heuristics  can  be  used  to 
choose  a  “good”  order  in  which  to  try  the  different  possible  expansions  of 
such  rules.  In  practise,  this  usually  means  using  heuristics  to  select  the  way 
in  which  the  U-rule  is  applied  to  the  disjunctions  in  a  node  label;  a  heuristic 
function  is  used  to  compute  the  relative  “goodness”  of  candidate  concept. 

When  using  the  Davis-Putnam  technique,  the  well  known  MOMS  heuris¬ 
tic  (Freeman,  1995)  is  often  used  to  select  the  concept  on  which  to  branch;  it 
tries  to  select  concept  that  will  maximise  the  effect  of  BCP  and  so  minimise 
the  number  of  non-deterministic  choices  needed  in  order  to  complete  the 
completion  graph  (Horrocks,  2003).  There  is  little  evidence,  however,  that 
(a  suitably  adapted  form  of)  this  heuristic  is  effective  with  concepts,  and  even 
some  evidence  to  suggest  that  interference  with  the  backjumping  optimisation 
makes  it  counter  productive  (Horrocks,  2003). 

An  alternative  heuristic,  whose  design  was  prompted  by  this  observation, 
tries  to  maximise  the  effect  of  backjumping  by  preferentially  selecting  con¬ 
cepts  with  low  valued  dependencies  (Horrocks,  2003;  Hladik,  2002).  This 
heuristic  has  the  added  advantage  that  it  can  also  be  used  to  select  the  order 
in  which  successor  nodes  are  expanded. 

Usually  several  possible  expansion-ordering  heuristics  can  be  used  to  choose 
the  order  in  which  to  explore  the  different  expansion  choices  offered  by  the 
non-deterministic  U-rule.  This  ordering  can  be  on  the  basis  of  the  size,  max¬ 
imum  quantifier  depth,  or  frequency  of  usage  of  each  of  the  concepts  in  the 
disjunction,  and  the  order  can  be  either  ascending  (smallest  size,  minimum 
depth  and  lowest  frequency  first)  or  descending.  In  order  to  avoid  the  cost  of 
repeatedly  computing  such  values,  all  the  relevant  statistics  for  each  concept 
can  be  gathered  as  the  knowledge  base  is  loaded. 

In  (Sirin  et  ah,  2005a)  learning-based  disjunct  selection  approach  was  pro¬ 
posed.  At  the  very  beginning  all  disjuncts  has  the  same  priority.  Later  on,  after 
clash  occurs  because  of  disjunction  expansion,  the  disjunct  which  was  a  cause 
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of  a  clash  is  penaltised.  During  consequent  expansions,  penaltised  disjuncts 
got  lower  priority,  according  their  penalty.  This  approach  should  work  better 
for  disjunctions  on  nominal  nodes,  because  they  are  expanded  large  amount 
of  time  in  similar  manner,  and  wrong  statically  chosen  ordering  on  them  leads 
to  significant  slowdown. 

4.4.4.  Combined  Expansion  Rule  for  Disjunction 

The  above  optimisations  for  disjunction  processing  can  be  described  as  the 
following  complex  rule  for  disjunctions: 

—  For  the  initial  expansion  of  a  disjunction,  determine  the  type  of  the 
disjunction,  given  by  the  concept  expression  C\  LI ...  U  Cn . : 

1.  If  Ci  €  £  ( x )  for  some  i,  then  there  is  no  need  to  expand  the  dis¬ 
junction. 

2.  Otherwise,  determine  the  set  C'  —  { (j , , . . . ,  C,k }  c  {  C\ , . . . ,  Cn } 
such  that  ~>Cij  f  £(.r)  for  all  j.  All  elements  of  set  C'  could  be 
added  to  a  label. 

3.  Choose  the  (heuristical)  weight  function  Heu{C)  such  that  if  H eu(C)  < 
Heu{C' )  then  C  should  be  chosen  to  expand  earlier  than  C' . 

4.  Initialize  the  set  of  processed  disjuncts  Cp  —  0. 

—  For  the  general  expansion  step: 

1.  Choose  element  c  from  the  ordered  set  C'  such  that  Vc'  6  C'  : 

Heu(c)  <  Heu(c'). 

2.  If  semantic  branching  is  in  use,  add  -> c'  to  C(x)  for  all  c'  6  Cp. 

3.  Set  Cp  =  Cp  U  {C}  and  C'  -  C"\{(7}. 

4.  If  C'  f  0,  then  create  a  branching  point. 

5.  Add  C  to  Z(x). 


4.4.5.  Save/restore  optimisation 

In  order  to  perform  backtracking  a  tableaux  algorithm  must  be  able  to  save 
its  internal  state  before  branching  and  restore  it  in  case  of  failure  detection. 
Besides  other  things,  state  of  the  completion  graph  (including  node  labels, 
edges  labels,  etc)  should  be  saved. 

A  naive  approach  would  be  to  save  everything  after  branching  and  restore 
everything  after  backjumping.  But  in  realistic  applications,  the  completion 
graph  may  contain  hundreds  or  thousands  of  nodes,  and  only  some  of  them 
are  changed  between  branching  decisions. 
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Moreover,  when  using  the  ToDo  List  approach,  branching  operations  are 
grouped  together,  and  so  they  are  often  expanded  in  all  a  row.  In  this  case, 
operations  usually  change  just  a  small  part  of  the  completion  graph  while 
the  rest  of  it  is  unchanged  between  different  branching  points.  All  this  makes 
naive  approach  inefficient  in  realistic  applications. 

4.4.5. 1.  Lazy  saving  The  goal  of  the  lazy  saving  optimisation  is  to  min¬ 
imise  saving  of  the  nodes  of  the  completion  graph.  Instead  of  saving  the 
whole  completion  graph  on  every  branching  point,  lazy  saving  implements 
a  “save-on-demand”  approach. 

In  this  approach  every  node  of  the  completion  graph  contains  an  additional 
field — the  branching  level  of  the  last  change.  Whenever  a  node  would  be 
changed  (this  includes  additions  to  node  label,  adding  new  incoming/outgoing 
edges,  merging  nodes,  changing  blocked  status,  etc.)  the  algorithm  checks  the 
last  saved  level  of  the  node.  If  the  current  branching  was  made  after  the  last 
save  of  the  node,  then  the  node’s  information  is  outdated.  The  algorithm  saves 
the  state  of  the  node,  which  now  can  be  safely  modified. 

In  addition  to  obvious  gains,  restoring  of  completion  graph  is  also  slightly 
simplified.  In  case  of  restoring  from  level  n  back  to  level  m,  if  the  node  wasn't 
changed  since  m,  there  is  nothing  to  do  with  respect  to  restore. 

In  large  completion  graphs  usually  only  a  small  number  of  nodes  is  changed 
at  the  same  branching  level.  This  is  especially  true  in  case  where  priority  of 
OR  operation  is  the  lowest.  So,  avoiding  unnecessary  saving  of  huge  amounts 
of  nodes  leads  to  increasing  performance.  However,  the  check  of  the  necessity 
of  saving  must  be  done  on  every  updating.  For  lazy  saving  to  be  cost-effective 
this  check  must  should  be  cheap.  In  the  KBs  with  a  small  number  of  branching 
concepts  this  check  may  consume  more  time  than  would  be  gained  as  a  result 
of  an  optimisation. 

4. 4. 5. 2.  Lazy  restoring  The  similar  idea,  restore-on-demand,  may  come 
to  one’s  mind.  Instead  of  doing  restoration  once  per  backjumping  for  all 
nodes  of  the  completion  graph,  it  is  possible  to  check  before  accessing  nodes 
whether  it  is  necessary  to  restore  the  state  of  the  node.  If  the  node  appears  to 
be  out  of  date,  it  is  necessary  to  restore  it  before  taking  any  information  from 
the  node. 

However,  this  approach  is  different  from  the  lazy  saving  in  the  following: 

—  Lazy  restoring  is  a  check-on-read  instead  of  a  check-on-write  technique. 
As  there  are  many  more  reads  than  writes  to  a  completion  graph  (V— rule 
application,  blocking  checks  etc),  many  more  resources  would  be  used 
for  these  checks. 

—  In  lazy  saving,  after  backjumping  new  branching  points  can  reuse  old 
labels  of  branching  levels.  This  is  because  there  is  no  one  entry  (after 
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backjumping)  which  uses  labels.  This  is  not  true  in  case  of  lazy  restoring, 
as  there  are  nodes  that  still  use  the  old  branching  labels.  In  some  cases, 
it  might  be  necessary  to  keep  and  (probably)  maintain  the  whole  tree  of 
branching  decisions. 

—  The  number  of  restores  is  usually  just  a  small  fraction  of  the  number  of 
saves.  In  our  experiments  on  realistic  KBs  restores  were  between  0.5  to 
5%  as  frequent  as  saves.  This  indicates  that  lazy  saving  would  provide 
at  least  20  times  more  benefits  than  lazy  restoring. 

4.5.  Combined  generating  rules 

After  creating  a  new  node  or  changing  role  labels  all  universal  restriction  rules 
must  be  re-applied.  However,  it  is  usually  better  to  re-apply  them  immediately 
during  generating  rule  expansion. 

For  example,  assume  concept  3R.C  was  expanded  in  node  x  and  lead  to 
the  creation  of  node  y  with  C  E  <C(y).  As  usually  a  3-rule  is  expanded  later 
than  a  V-rule,  all  V-concepts  appearing  in  £(x)  were  already  expanded.  But 
now  the  new  /(-neighbour  of  x  immediately  appears,  so  all  V-concepts  in 
£(x)  must  be  expanded  again.  Some  work  can  be  saved  if  the  algorithm  can 
ensure  that  all  such  concepts  were  initially  expanded  before  any  3-concept. 
In  this  case,  it  is  enough  to  re-expand  only  those  MS.D  concepts,  for  which 
R  C*  S. 

4.6.  Caching  SAT  status 

4. 6. 0.3.  Summary:  Instead  of  creating  new  node  using  (combined)  gener¬ 
ating  rule,  it  is  possible  to  check  SAT  status  of  new  node  by  trying  to  merge 
(cached)  models  of  concepts  in  new  node.  Is  such  models  can  be  merged,  then 
there  is  no  need  to  expand  corresponding  subtree. 

Assume  no  inverse  roles  and  no  nominals  are  in  KB  and  top-down  tech¬ 
nique  is  used.  I.e.,  The  expansion  rules  are  sorted  with  generating  rules  have 
the  lowest  priority. 

In  this  case,  all  information  from  predecessors  is  added  to  a  node  label 
before  it  is  processed.  This  means  that,  when  a  given  node  has  been  fully 
expanded  (i.e.,  the  expansion  rules  have  been  exhaustively  applied  to  it),  a 
successor  node  y  with  £(y)  =  { C\ , . . . ,  Cn}  can  be  treated  as  an  independent 
problem,  equivalent  to  testing  the  satisfiability  of  C\  13  . . .  n  Cn. 

A  completion  tree  may  contain  many  such  nodes,  and  the  labels  of  nodes 
tend  to  be  quite  similar.  For  some  concepts,  this  may  result  in  the  same  sub¬ 
problem  being  solved  again  and  again.  In  order  to  avoid  this,  it  is  possible 
to  cache  and  re-use  the  results  of  such  sub-problems.  The  usual  technique  is 
to  use  a  hash  table  to  store  the  satisfiability  status  of  node  labels  (i.e.,  sets  of 
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concepts  treated  as  a  conjunction).  Before  applying  any  expansion  rules  to  a 
new  node  x,  the  cache  is  interrogated  to  determine  if  the  satisfiability  status 
of  £(x)  is  already  known.  If  it  is  known,  then  the  result  can  be  used  without 
further  expansion,  i.e.,  £(x)  can  be  treated  as  though  it  were  either  {_L}  (for 
unsatishable)  or  {T }  (for  satisfiable).  If  the  satisfiability  status  of  £(#)  is  not 
known,  then  L(x)  is  added  to  the  cache,  and  its  status  set  to  satisfiable  if  a 
complete  and  open  completion  tree  rooted  in  x  can  be  constructed,  and  to 
unsatishable  otherwise.  The  procedure  of  determining  satisfiability  status  for 
given  set  of  concepts  is  well-known  and  was  described  in  a  great  details  in 
several  papers  (Horrocks,  2003;  Haarslev  and  Moller,  2001a). 

Since  the  satisfiability  of  a  set  of  concepts  L  implies  the  satisfiability  of 
each  subset  of  L,  and  the  unsatisfiability  of  a  set  of  concepts  L  implies  the 
unsatisfiability  of  each  superset  of  L,  this  basic  idea  can  be  extended  to  check 
for  satisfiable  supersets  of  11 ( x )  and  unsatishable  subsets  of  L(x).  However, 
this  requires  a  considerably  more  sophisticated  data  structure  if  cache  op¬ 
erations  are  to  be  efficient  (Hoffmann  and  Koehler,  1999;  Giunchiglia  and 
Tacchella,  2000). 

Apart  from  the  problem  of  the  storage  required  for  the  cache,  another  more 
subtle  disadvantage  of  caching  is  that,  in  the  case  where  the  cache  returns 
"unsatishable”  for  L(x),  there  is  no  information  about  the  cause  of  the  unsat  - 
ishability  that  can  be  used  to  derive  the  dependency  information  required  for 
backjumping.  Backjumping  can  still  be  performed  by  combining  the  depen¬ 
dency  sets  of  all  of  the  concepts  in  L(x),  but  this  is  likely  to  overestimate  the 
set  of  branching  points  on  which  the  unsatishability  depends. 

In  presence  of  inverse  roles  or  nominals,  however,  this  technique  can  not 
be  directly  used.  If  SAT  checking  involves  nominals,  they  can  be  referred 
from  different  nodes  of  completion  graph,  so  new  concepts  may  be  propa¬ 
gated  to  already  cached  node.  In  case  of  inverse  roles,  information  may  be 
propagated  from  the  node  label  to  it  parent,  so  just  checking  SAT  status  of 
the  child  is  not  enough.  Latest  publications  (Yu  Ding,  2005)  propose  some 
directions  to  use  of  cache  in  presence  of  inverse  roles  but  logic  used  there  is 
much  weaker  than  STLOIQ. 


5.  Optimisations  in  Classification 

Classification  is  the  process  of  establishing  partial  order  <  on  the  set  of 
named  concepts  in  KB  w.r.t.  subsumption  relation,  i.e.  C  <  D  -<==>•  C  Q  I). 
This  order  is  called  hierarchy ,  or  taxonomy  of  concepts.  Classification  is 
one  of  the  main  operations,  performing  by  a  reasoner  for  simplify  following 
subsumption  queries. 

Traditionally,  this  partial  order  relation  is  built  iteratively.  It  is  initialised 
with  trivial  relation  A  <  T  and  on  every  iteration  one  new  concept  name  C 
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is  added  to  a  hierarchy.  For  every  concept  name  C  to  be  added  to  hierarchy, 
the  set  of  parents  (i.e.,  already  classified  immediate  subsumers)  and  the  set  of 
children  (i.e.,  already  classified  immediate  subsumees)  is  being  defined.  Sets 
of  parents  and  children  uniquely  identify  place  of  C  in  the  taxonomy. 

Set  of  parents  (children)  is  defined  by  a  procedure  called  top-down  (resp. 
bottom-up)  search.  These  two  procedures  are  very  similar,  so  we  describe 
only  one  of  them  in  details. 

Top-down  search  phase  for  concept  C  involves  breadth-first  search  of 
subsumers  of  C  starting  from  T.  Once  it  is  determined  that  C  Q  D  the  algo¬ 
rithms  search  subsumers  between  children  of  D.  Concepts  that  are  subsumers 
of  C  but  none  of  its  children  is  subsumer  of  C  became  parents  of  C. 

This  algorithm  was  first  described  in  (Baader  et  al.,  1994).  This  paper  also 
adds  some  optimisation  that  allows  significantly  reduce  search  space. 

5. 1 .  Taxonomy  Creation  Order 

Concept  name  C  directly  uses  concept  name  D  if  D  occurs  in  the  definition 
of  C.  The  relation  uses  is  a  transitive  closure  of  directly  uses.  If  C  uses  D  then 
D  comes  before  C  in  so-called  definition  order.  Assume  that  KB  is  classified 
in  definition  order,  i.e.  a  concept  name  is  not  classified  until  all  the  named 
concepts  it  uses  are  classified.  In  (Baader  et  al.,  1994)  it  was  shown,  that  in 
this  case  bottom-up  search  phase  can  be  omitted  for  primitive  concepts  (the 
only  child  for  such  concept  would  be  _L  node).  Easy  to  see  that  it  is  true  for 
SHOXQ  TBox  T  =  Tu  U  Tr  U  Tg  with  Tg  —  0,  i.e.  without  GCIs.  If  Tg  ^  0, 
however,  it  is  not  true.  Assume  TBox  T  —  TuUTg  with 

%  =  {Ci  E  D,  C2  E  D},Tg  =  {T  E  Ci  n  C2}. 

In  this  TBox  concept  D  should  be  classified  equal  to  T,  i.e.  have  T  both 
as  a  parent  and  children.  If  bottom-up  phase  would  be  omitted,  however, 
algorithm  would  be  unable  to  find  T  E  D  subsumption  and  hence  gives 
wrong  results. 

In  (Haarslev  and  Moller,  2001a)  this  approach  was  modified  in  order  to 
allow  to  skip  bottom-up  phase  in  more  cases.  Authors  propose  to  use  directly 
refers  to  relation,  which  is  similar  to  directly  uses,  but  with  references  occur¬ 
ring  in  scope  of  quantifiers  are  not  considered.  Again  refers  to  is  transitive 
closure  of  directly  refers  to,  and  this  relation  induces  a  partial  order  relation 
on  sets  of  concept  names.  Total  order  produced  by  this  relation  is  called 
quasi-definition  order. 

Usage  of  quasi-definition  order  instead  of  definition  order  suppose  that 
subsumption  relation  between  primitive  concepts  can't  be  derived  via  quanti¬ 
fiers.  This  is  not  true  in  presence  of  inverse  roles.  For  the  KB 

C  E  D,D  E  3R.VR-.C 
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quasi-definition  order  will  set  D  <  C,  while  semantically  C  =  D,  i.e.  both 
C  and  I)  should  be  classified  into  the  same  node  of  taxonomy. 

5.2.  Told  Subsumers  and  Disjoints 

Various  optimisations  are  used  in  order  to  minimise  the  number  of  subsump¬ 
tion  tests  needed  in  each  phase.  For  example,  when  adding  a  concept  C  to 
the  hierarchy,  a  top-down  breadth  first  traversal  is  used  that  only  checks  if  D 
subsumes  C  when  it  has  already  been  determined  that  C  is  subsumed  by  all 
the  concepts  in  the  hierarchy  that  subsume  D.  The  structure  of  TBox  axioms 
is  also  used  to  compute  a  set  of  told  subsumers  of  C  (i.e.,  trivially  obvious 
subsumers).  See  Section  3.2.3  for  the  formal  definition  of  a  told  subsumer.  As 
subsumption  is  immediate  for  told  subsumers,  no  tests  need  to  be  performed 
w.r.t.  these  concepts.  In  order  to  maximise  the  benefit  of  this  optimisation,  all 
of  the  told  subsumers  of  a  concept  C  are  classified  before  C  itself  is  classified. 

The  told  subsumer  optimisation  can  be  used  to  approximate  the  position  of 
C  in  the  hierarchy:  all  of  its  told  subsumers  can  be  marked  as  subsumers  of  C. 
The  most  specific  concepts  in  this  set  of  marked  concepts  are  then  candidates 
to  be  parents  of  C.  In  the  standard  algorithm,  however,  it  is  necessary  to  check 
(recursively)  if  the  children  of  these  concepts  are  also  subsumers  of  C.  This 
can  be  costly  in  the  case  where  one  of  the  told  subsumers  has  a  very  large 
number  of  children.  When  it  has  been  determined  for  some  subsumer  D  of  C 
that  none  of  the  children  of  D  subsume  C,  then  we  know  that  I)  is  a  parent 
of  C. 

At  the  end  of  the  top-down  phase  we  will  have  computed  the  set  of  parents 
of  C ;  all  of  the  concepts  in  this  set,  along  with  all  their  super-concepts,  are 
subsumers  of  C;  all  other  concepts  are  non-subsumers  of  C.  The  next  step  is 
to  determine  the  set  of  children  of  C  (as  mentioned  above,  this  step  can  be 
omitted  for  a  primitive  concept  when  concepts  have  been  classified  in  defini¬ 
tional  order  (Baader  et  al.,  1994)).  This  phase  is  very  similar  to  (the  reverse 
of)  the  top-down  one,  and  we  won’t  describe  it  here — interested  readers  can 
refer  to  (Baader  et  al.,  2003)  for  full  details. 

In  addition  to  use  told  subsumers  to  propagate  positive  subsumption  infor¬ 
mation,  it  is  possible  to  use  told  disjoints  to  propagate  negative  subsumption 
information  (Haarslev  and  Moller,  2001a).  Concept  D  is  told  disjoint  with 
concept  C  if  definition  of  C  looks  like  C  C  -<D  n _ Flaving  told  dis¬ 

joint  information,  it  is  possible  to  immediately  state  non-subsumption  (and 
propagate  this  information  if  necessary). 

5.3.  Reducing  Number  of  Subsumption  Tests 

In  some  cases,  it  is  possible  to  replace  expensive  subsumption  test  with  an¬ 
other  (incomplete,  but  cheap)  tests. 
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As  one  of  such  methods  pseudo-model  merging  technique  described  in 
details  in  Section  4.6  can  be  used.  If  models  for  D  and  -> C  can  be  merged, 
then  concept  -> C  n  D  is  satisfiable  and  then  subsumption  C  E  D  does  not 
holds.  Unfortunately,  as  noted  in  Section  4.6,  this  test  can't  be  applied  directly 
in  presence  of  nominals  in  the  TBox. 

Other  methods  were  developed  to  reduce  number  of  checks  while  travers¬ 
ing  taxonomy  during  top-down  or  bottom- up  phase.  As  these  two  phase  are 
very  similar,  we  will  talk  about  optimisations  w.r.t.  the  top-down  one. 

Some  labelling  optimisations  were  proposed  in  (Baader  et  al.,  1994)  in  ad¬ 
dition  to  the  told  subsumer  one.  When  D  is  going  to  be  checked  as  a  subsumer 
of  classifying  concept  C,  it  is  first  checked  whether  all  subsumers  of  D  are 
subsumers  of  C.  The  opposite  one  is  non-subsumption  propagation:  if  C  Q  D 
subsumption  does  not  holds,  so  do  C  E  D'  for  all  D'  that  are  subsumees  of 
D.  These  two  optimisations  does  not  work  well  together;  in  (Baader  et  al., 
1994)  it  was  shown  that  the  former  one  is  more  efficient  in  general  than  the 
latter. 

In  case  of  wide  (and  shallow)  taxonomies,  one  taxonomy  node  (with  label 
D)  may  have  tens  and  hundreds  children  Dj .  If  such  a  node  is  labelled  as  a 
subsumer  of  classifying  concept  C,  then  large  amount  of  subsumption  tests 
C  E  Dj  should  be  made.  In  this  case,  in  (Haarslev  and  Moller,  2001a)  so- 
called  clustering  technique  is  proposed.  For  concepts  I)\  ...  Dq  new  virtual 
concept  D'  =  I)\  U  ...  U  Do  is  inserted  into  taxonomy.  And  instead  of 
checking  C  E  Dj  first  subsumption  D'  Q  C  is  checked.  This  check  is  done 
using  model  merging  technique,  because  model  for  ->  D'  is  just  a  union  of 
negated  names  for  primitive  concepts  D\, . . .  ,Dq.  In  addition,  it  is  possible 
to  have  several  virtual  concepts  associated  with  the  same  parent  node. 

Another  way  to  reduce  number  of  subsumption  tests  is  to  use  notion  of 
completely  defined  concepts,  that  we  are  going  to  describe  in  more  details 
below. 

5.4.  Completely  Defined  Concepts 

Given  a  TBox  T,  a  primitive  concept  C  is  said  to  be  completely  defined  w.r.t. 
T  when,  for  the  definition  C  E  C\  n  . . .  n  Cn  in  T,  it  holds  that: 

1 .  For  all  1  E  i  E  «,  Cj  is  a  primitive  concept. 

2.  (minimality)  There  exist  no  i,j  such  that  1  <  i,  j  <  n  and  Cj  is  a  told 
subsumer  of  C, . 

When  the  TBox  is  obvious  from  the  context  we  will  talk  about  completely 
defined  concepts  without  reference  to  a  TBox. 

If  we  assume  a  cycle-free  TBox  containing  only  CD  concepts  and  no 
GCIs,  then  the  classification  process  is  very  simple.  In  fact,  we  don't  need 
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to  perform  any  subsumption  tests  at  all:  the  position  of  every  concept  in  the 
hierarchy  is  completely  defined  by  its  told  subsumers.  If  concepts  are  pro¬ 
cessed  in  definitional  order,  then  when  a  concept  C  is  classified,  where  C  is 
defined  by  the  axiom  C  C  C\  n  . . .  n  Cn,  the  parents  of  C  are  C\, ,  Cn , 
and  the  only  child  of  C  is  _L.  Note  that  every  concept  in  such  a  taxonomy  is 
satishable,  because  there  is  no  use  of  negation. 

Correctness  of  the  above  method  for  classification  of  a  cycle-free  TBox 
which  contains  only  completely-defined  concepts  and  no  general  axioms  was 
proved  in  (Tsarkov  and  Horrocks,  2005a). 

This  fact  is,  however,  of  very  little  practical  value  due  to  the  very  stringent 
conditions  on  the  structure  of  the  TBox.  In  the  following,  we  will  show  how 
the  basic  technique  can  be  made  more  useful  by  weakening  some  of  these 
conditions. 

5.4.1.  Primitivity 

In  general,  a  CD  concept  should  not  have  non-primitive  concepts  in  its  def¬ 
inition.  This  is  because  when  the  hierarchy  already  includes  non-primitive 
concepts  (which  will  be  the  case  given  definitional  order  classification)  the 
bottom-up  phase  can  not  be  omitted,  and  the  CD  method  could  therefore  lead 
to  incorrect  results.  Assume,  e.g.,  a  TBox 

{C  c  Ci  n  c2  n  c3,  C"  =  Ci  nc2}.  (l) 

Using  the  CD  classification  approach,  C  will  be  classified  under  Cj ,  C-i  and 
C3,  whereas  it  should  be  classified  under  O'  and  C3. 

5.4.2.  Minimality 

Non-minimal  concepts  may  occur  as  a  result  of  badly  designed  KBs,  im¬ 
proper  usage  of  Domain  constraints  and/or  due  to  absorption  of  GCIs.  The 
minimality  check  may,  however,  be  removed  from  the  definition  of  CD  con¬ 
cepts  provided  that  we  check  for  any  non-minimal  concepts  at  classification 
time,  i.e.,  we  check  if  each  C,  in  a  definition  C  C  C\  n  . . .  n  Cn  is  really 
a  parent  of  C  (i.e.,  has  no  children  that  are  subsumers  of  C ).  This  check  is 
relatively  cheap,  and  is  already  implemented  as  part  of  the  standard  classifi¬ 
cation  algorithm  (see  Section  5.4.5),  where  it  is  used  to  check  which  of  the 
told  subsumers  of  a  concept  C  are  possible  parents  of  C. 

5.4.3.  Non-CD  Concepts 

CD-classification  would  be  of  limited  interest  if  its  applicability  were  limited 
to  KBs  consisting  entirely  of  CD  concepts.  This  is  because  most  "interesting” 
KBs,  including  most  KBs  designed  using  DL  based  ontology  languages,  will 
contain  concept  constructors  other  than  conjunction,  and  this  will  lead  to 
some  concepts  being  non-CD;  in  its  current  form  the  CD  approach  would, 
therefore,  be  useless. 
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On  the  other  hand,  almost  all  TBoxes  will  contain  some  CD  concepts.  In 
this  case,  it  may  be  possible  to  split  the  TBox  into  two  parts — a  CD  part  (i.e., 
containing  only  CD  concepts)  and  a  non-CD  part — and  use  the  CD  algorithm 
only  for  the  CD  part. 

Note  that  such  a  split  will  not  introduce  any  problems  if  the  CD  part  of 
the  classification  is  performed  first — in  fact  the  classification  of  the  CD  part 
is  independent  of  the  non-CD  part  of  the  TBox,  because  the  definitions  of  CD 
concepts  only  refer  to  other  CD  concepts. 

In  the  TBox  1  above,  for  example,  concepts  C\ ,  C'2 ,  C3  and  C  will  be  in 
the  CD  part,  and  C'  in  the  non-CD  part.  After  CD-classification  C  will  have  3 
parents,  and  standard  algorithm  then  insert  C'  with  parents  Ci,  C'2  and  child 
C. 

5.4.4.  General  Axioms 

It  is  easy  to  see  that,  in  the  general  case,  the  CD  approach  cannot  be  used  in 
the  presence  of  GCIs.  Consider,  for  example,  a  TBox  T  —  Tu  U  Tg  with 

Tu  =  {C  C  T,A  C  D,B  C  D},Tg  =  {T  C  AUB}. 

In  this  case,  the  CD  algorithm  classifies  C  under  T,  whereas  it  should  be 
classified  under  D.  The  CD  approach  is,  therefore,  applicable  only  if  all  GCIs 
in  the  TBox  are  absorbed  (i.e.,  Tg  —  0)  using  the  techniques  described  in 
Section  3.2.5. 

5.4.5.  Two-stage  Approach  Using  CD 

A  two-stage  CD  classification  algorithm  has  been  implemented  in  our  FaCT++ 
reasoner.  After  preprocessing,  as  described  in  Section  3.1,  the  TBox  is  clas¬ 
sified  using  the  following  procedure: 

1.  If  the  TBox  does  not  contain  any  GCIs,  mark  some  concepts  as  CD. 
Namely,  T  is  marked  as  CD;  a  primitive  concept  C  is  marked  as  CD 
if  it  has  the  definition  C  C  Ci  n  . . .  n  Cn  and  every  Ci  is  marked  CD;  a 
non-primitive  concept  D  is  marked  CD  if  it  has  definition  D  —  C  and  C 
is  marked  CD. 

2.  If  the  TBox  contains  concepts  marked  as  CD,  then  run  the  CD-classiher. 
The  CD  classifier  works  only  on  those  concepts  that  are  marked  CD, 
processing  them  in  definitional  order.  For  each  such  concept  C,  the  steps 
it  performs  are  as  follows: 

a)  If  C  is  a  synonym  of  some  already  classified  concept  D,  then  insert 
C  at  the  same  place  as  D. 

b)  If  C  has  definition  C  C  C\  n  . . .  n  Cn,  concepts  C\, ...  ,Cn  are 
candidates  to  be  parents  of  C. 
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c)  For  every  candidate  Ci,  check  whether  it  is  redundant,  i.e.  whether  Cj 
has  a  child  that  is  an  ancestor  of  a  C.  This  can  be  done  by  labelling 
all  ancestors  of  candidate  concepts:  labelled  candidates  will  be  re¬ 
dundant.  Remove  redundant  candidates  from  the  list  of  candidates. 

d)  Insert  C  into  the  taxonomy  with  the  remaining  candidates  as  parents 
and  ±  as  the  only  child. 

3.  Classify  the  remaining  concepts  using  the  standard  classification  algo¬ 
rithm. 


6.  Discussion 


6. 1 .  Notes  on  Evaluation 

We  does  not  plan  to  put  any  evaluation  on  this  paper.  Most  of  optimisations 
mentioned  here  are  described  in  other  papers,  which  usually  contain  some 
empirical  evaluation.  In  this  section  we  just  summarise  all  described  optimi¬ 
sations  and  mark  them  with  respect  to  their  influence  on  overall  performance 
of  a  DL  reasoner. 

6.1.1.  Preprocessing  Optimisations 

Note  that  all  preprocessing  optimisation  takes  at  most  polynomial  time  to  run, 
while  they  hardly  hurts  performance  of  further  reasoning.  This  makes  them 
good  in  almost  any  cases. 

Lexical  Normalisation.  It  helps  a  lot  in  founding  clashes  earlier,  so  lazy 
unfolding  optimisation  performs  much  better. 

Lexical  Simplifications.  Most  of  them  are  so  simple,  that  takes  literally  no 
time  to  apply.  Some  of  simplifications,  defined  in  Section  3.1  though  are 
more  costly  and  gives  effect  only  in  very  special  cases.  It  makes  sense 
then  to  apply  such  transformation  to  concept  expressions  that  are  known 
to  be  complex  and  are  likely  to  follow  under  this  cases  (like  GCIs  and 
absorbed  concept  definitions). 

Lazy  Unfolding.  Easy  to  see  that  use  of  C-rule  in  realistic  DL  reasoners  is  a 
nonsense  due  to  huge  amount  of  disjunctions  it  introduces.  Thus  there  is 
no  reasons  not  to  use  lazy  unfolding. 

Synonym  Replacement.  This  optimisation  has  linear  complexity  and  works 
well  together  with  told  cycle  elimination  and  some  special  kind  of  KBs 
(e.g.  those  which  were  obtained  by  merging  several  KBs  together). 
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Told  Cycle  Elimination.  While  not  very  useful  by  itself  and  rarely  appli¬ 
cable  in  a  real-life  KBs,  this  optimisation  costs  nothing  (as  told  sub- 
sumers  are  necessary  for  classification)  and  if  applicable,  allows  more 
optimisation  (like  absorption  and  definition-order  classification)  to  run. 

Redundant  Subsumption  Elimination.  Again,  it  is  hard  to  see  any  use  of 
such  an  optimisation  for  a  nornal  KB,  but  it  is  useful  to  run  it  after  told 
cycle  elimination  (and  in  general,  in  presence  of  cycles  of  length  1). 

Absorption.  As  shown  in  all  optimisation  paper  on  this  topic  (see  (Hor- 
rocks  and  Tobies,  2000b;  Tsarkov  and  Horrocks,  2004)),  absorption  is 
crucial  optimisation  for  DL  reasoners.  Is  is  though  hard  to  define  the 
best  absorption  (and  it  is  even  hard  to  say  what  does  it  mean),  but  any 
absorption,  obtained  from  a  TBox  is  better  in  terms  of  classification 
time. 


6. 1 .2.  Core  Satisfiability  Optimisations 

For  expressive  DLs  like  STLOIQ  reasoning  is  intractable.  This  means  that 
reasoning  procedures  have  at  least  exponential  complexity.  The  task  of  op¬ 
timisations,  thus,  is  to  reduce  complexity  of  a  reasoning  of  a  frequent  tasks 
(classes  of  tasks). 

TODO  List  Architecture.  Using  such  an  optimisation  makes  reasoner  more 
flexible.  In  (Tsarkov  and  Horrocks,  2005a)  we  shows  that  in  most  cases 
giving  branching  rules  the  lowest  priority  is  better.  Note  that  the  stan¬ 
dard  top-down  approach  gives  the  lowest  priority  to  generating  rules. 
Moreover,  decision  procedure  for  STLOIQ  requires  some  ordering  on 
expansion  rule  application  in  order  to  ensure  termination  (Horrocks  and 
Sattler,  2005).  This  requirement  is  easily  incorporated  into  TODO  list 
schema. 

Note  that  TODO  list-based  reasoner  (unlike  the  top-down  approach)  can 
not  stay  in  polynomial  space  anymore.  Though,  reasoning  with  inverse 
roles  required  exponential  space,  so  any  reasoner  for  STLOIQ  would 
lost  this  ability. 

BCP.  This  simple  optimisation  might  transform  non-deterministic  operation 
into  a  deterministic  one.  In  any  case,  it  does  not  harm  reasoning  process 
as  it  prevents  reasoner  to  choose  disjuncts  that  immediately  leads  to 
clash. 

Semantic  Branching.  As  mentioned  in  (Horrocks  and  Patel-Schneider,  1999), 
using  this  optimisation  gives  significant  increase  of  performance  on  arti¬ 
ficial  tests.  Our  experiments  with  real  applications  shows  that  using  this 
optimisations  gives  very  small,  but  almost  always  positive  effect. 
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Heuristics  for  Disjunction.  As  shown  in  (Tsarkov  and  Horrocks,  2005a), 
choosing  improper  heuristic  might  lead  to  completely  impractical  rea¬ 
soning  (the  difference  in  some  cases  is  more  then  three  orders  in  mag¬ 
nitude).  Unfortunately,  we  don’t  have  stable  way  of  choosing  proper 
heuristic,  so  it  is  a  part  of  future  research. 

Note  also,  that  some  heuristics  (like  MOMS),  interacts  badly  with  back- 
jumping  (which  is  one  of  the  most  beneficial  optimisation  techniques  in 
SAT  testing). 

Backjumping.  Another  optimisation  that  is  crucial  for  DL  reasoning.  Even 
the  necessarity  to  build  and  carefully  maintains  dependency  sets  is  over¬ 
powered  by  benefits  that  are  obtained  from  this  optimisation. 

Lazy  Saving.  This  optimisation  works  well  in  KBs  that  have  complex  struc¬ 
ture  (and,  hence  large  completion  graphs  appears  during  SAT  tests).  This 
is  also  very  important  if  TBox  contains  large  number  of  nominals,  as 
usually  all  (or  large  part)  of  them  should  always  exists  in  a  completion 
graph  (Sirin  et  ah,  2005a). 

Caching  SAT  Status.  This  optimisation  is  applicable  only  in  absence  of  in¬ 
verse  roles.  But  in  this  case  it  shows  amazing  improvement  in  reasoning, 
significantly  reducing  time  of  building  completion  graph. 


6.1.3.  Classification  Optimisations 

Note  that,  while  all  classification  optimisations  saves  (at  most)  quadratic  amount 

of  necessary  subsumption  tests,  each  test  might  take  exponentially  many  op¬ 
erations  by  itself,  which  lead  to  significant  improvements. 

Taxonomy  Creation  Order. 

Told  Subsumer  and  Disjoints. 

Caching  Subsumptions.  All  these  optimisations  works  well  when  applica¬ 
ble,  reducing  number  of  performed  subsumption  tests  in  the  price  of  (at 
most)  polynomial  actions.  They  should  be  used  every  time,  as  experi¬ 
ments  (Baader  et  al.,  1994;  Haarslev  and  Moller,  2001a)  shows  benefits 
of  them  in  both  artificial  and  natural  KBs. 

Completely  Defined  Classification.  This  optimisation,  as  it  shown  in  (Tsarkov 
and  Horrocks,  2005b),  gives  the  best  results  on  large  KBs  with  very 
simple  structure.  At  the  same  time,  it  does  not  require  any  extra  actions, 
and  does  not  hurts  general  KBs,  as  most  of  them  contains  (large  or  small) 
part  that  can  be  dealt  with  more  optimal  than  using  standard  algorithm. 
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Abstract.  This  is  a  system  description  of  the  Description  Logic  reasoner 
FaCT++  1.0.  The  reasoner  is  modelled  on  the  well-known  FaCT  reasoner,  but 
employs  a  different  system  architecture  and  has  some  new  features  and  optimisa¬ 
tions. 


1  Introduction 

Description  Logics  (DLs)  are  a  family  of  logic  based  knowledge  representation  for¬ 
malisms  [1].  Although  they  have  a  range  of  applications,  they  are  perhaps  best  known  as 
the  basis  for  widely  used  ontology  languages  such  as  OIL,  DAML+OIL  and  OWL  [10]. 

A  key  motivation  for  basing  ontology  languages  on  DLs  is  that  DL  systems  can 
then  be  used  to  provide  computational  services  for  ontology  tools  and  applications  [12, 
13].  The  increasing  use  of  ontologies,  along  with  increases  in  their  size  and  complexity, 
brings  with  it  a  need  for  efficient  DL  reasoners.  Given  the  high  worst  case  complexity 
of  the  satisfiability/subsumption  problem  for  the  DLs  in  question  (at  least  ExpTime- 
complete),  optimisations  that  exploit  the  structure  of  typical  ontologies  are  crucial  to 
the  viability  of  such  reasoners. 

FaCT++  is  a  new  DL  reasoner  designed  as  a  platform  for  experimenting  with  new 
tableaux  algorithms  and  optimisation  techniques.1  It  incorporates  most  of  the  standard 
optimisation  techniques,  including  those  introduced  in  the  FaCT  system  [7],  but  em¬ 
ploys  a  new  “ToDo  list”  architecture  that  is  better  suited  to  more  complex  tableaux 
algorithms  (such  as  those  used  to  reason  with  OWL  ontologies),  and  allows  for  a  wider 
range  of  heuristic  optimisations. 

2  Tableaux  Reasoning  and  Architecture 

Most  modern  DL  systems  are  based  on  tableaux  algorithms.  Such  algorithms  were  first 
introduced  by  Schmidt-SchauB  and  Smolka  [15],  and  subsequently  extended  to  deal 
with  ever  more  expressive  logics  [1].  Many  systems  now  implement  the  SH1Q  DL,  a 
tableaux  algorithm  for  which  was  first  presented  in  [11];  this  logic  is  very  expressive, 
and  corresponds  closely  to  the  OWL  ontology  language.  In  spite  of  the  high  worst 
case  complexity  of  the  satisfiability/subsumption  problem  for  this  logic  (ExpTime- 
complete),  highly  optimised  implementations  have  been  shown  to  work  well  in  many 
realistic  (ontology)  applications  [7]. 

Tableaux  algorithms  work  by  trying  to  construct  a  labelled  graph  (usually  a  tree) 
representation  of  a  model  of  a  concept,  the  structure  of  which  is  determined  by  syntactic 
decomposition  of  the  concept.  The  decomposition  and  construction  is  usually  carried 

1  FaCT++  is  available  at  http :  / / owl .  man .  ac  .  uk/ f  actplusplus. 


out  by  applying  so  called  tableaux  expansion  rules  to  the  concepts  in  node  labels,  with 
one  rule  being  defined  for  each  of  the  syntactic  constructs  in  the  language  (with  the 
exception  of  negation,  which  is  pushed  inwards,  using  re-writings  such  as  de  Morgan’s 
laws,  until  it  applies  only  to  atomic  concepts). 

Most  implementations  use  a  top-down  construction  based  on  the  so-called  trace 
technique  [5],  The  idea  of  this  technique  is  to  apply  generating  rules  (rules  which  add 
new  leaf  nodes  to  the  tree)  with  the  lowest  priority  (i.e.,  only  apply  these  rules  when  no 
other  rules  are  applicable),  and  to  discard  fully  expanded  sub-trees,  so  that  only  a  single 
“trace”  (i.e.,  a  branch  of  the  tree)  is  kept  in  memory  at  any  one  time.  This  allows  (for 
PSpace  logics)  tree  structures  that  may  be  exponential  in  size  (with  respect  to  the  size 
of  the  input  concept)  to  be  delineated  using  only  polynomial  space.  For  the  ExpTime 
logics  implemented  in  modern  systems,  however,  guaranteeing  polynomial  space  usage 
is  no  longer  an  option.  Moreover,  for  logics  that  support  inverse  roles  (such  as  STilQ), 
a  strictly  top  down  approach  is  no  longer  practical  as  constraints  may  be  propagated 
both  “up”  and  “down”  the  edges  in  the  tree. 

In  spite  of  this,  top-down  construction  is  still  widely  used:  it  has  the  advantage 
of  being  very  simple  and  easy  to  implement — a  procedure  that  exhaustively  expands 
a  node  label  can  be  applied  to  the  current  node  and  then,  recursively,  to  each  of  its 
successors.  It  does,  however,  have  some  serious  drawbacks.  In  the  first  place,  for  logics 
with  inverse  roles,  a  strictly  top-down  construction  would  break  down  as  it  relies  on 
the  fact  that  rules  only  ever  add  concepts  to  the  label  of  the  node  to  which  they  are 
applied  or  to  the  label  of  one  of  its  successor  nodes.  In  the  presence  of  inverse  roles, 
expansion  rules  may  also  add  concepts  to  the  labels  of  predecessor  nodes,  which  could 
then  require  further  expansion.  Moreover,  discarding  fully  expanded  sub-trees  may  no 
longer  be  possible,  as  the  expansion  of  a  concept  added  to  the  label  of  a  predecessor 
may  cause  concepts  to  be  added  to  the  label  of  a  sibling  node  that  had  previously  been 
fully  expanded. 

In  the  second  place,  the  top  down  method  forces  non-deterministic  rules  (e.g.,  the 
rule  for  expanding  disjunctions)  to  be  applied  with  a  higher  priority  than  generating 
rules.  As  the  size  of  the  search  space  caused  by  non-deterministic  rule  expansions  is, 
in  practice,  by  far  the  most  serious  problem  for  tableaux  based  systems  [6],  it  may  be 
advantageous  to  apply  non-deterministic  rules  with  the  lowest  priority  [4].  In  fact,  top- 
down  implementations  typically  apply  non-deterministic  rules  with  a  priority  that  is 
lower  than  that  of  all  of  the  other  rules  except  the  generating  rules  [9], 

ToDo  List  Architecture  The  FaCT++  system  was  designed  with  the  intention  of  im¬ 
plementing  DLs  that  include  inverse  roles,  and  of  investigating  new  optimisation  tech¬ 
niques,  including  new  ordering  heuristics.  Currently,  FaCT++  implements  SUIT,  a 
slightly  less  expressive  variant  of  SHIQ  where  the  values  in  cardinality  restrictions 
can  only  be  zero  or  one.2 

Instead  of  the  top-down  approach,  FaCT++  uses  a  To  Do  list  to  control  the  appli¬ 
cation  of  the  expansion  rules.  The  basic  idea  behind  this  approach  is  that  rules  may 
become  applicable  whenever  a  concept  is  added  to  a  node  label.  When  this  happens,  a 
note  of  the  node/concept  pair  is  added  to  the  ToDo  list.  The  ToDo  list  sorts  all  entries 
according  to  some  order,  and  gives  access  to  the  “first”  element  in  the  list.  A  given 
tableaux  algorithm  takes  an  entry  from  the  ToDo  list  and  processes  it  according  to  the 
expansion  rule(s)  relevant  to  the  entry  (if  any).  During  the  expansion  process,  new  con- 


2  SUIT  corresponds  to  the  OWL-Lite  ontology  language  [10]. 


cepts  may  be  added  to  node  labels,  and  hence  entries  may  be  added  to  the  ToDo  list. 
The  process  continues  until  either  a  clash  occurs  or  the  ToDo  list  become  empty. 

This  architecture  has  a  number  of  advantages  when  compared  to  the  top-down  ap¬ 
proach.  Firstly,  it  is  applicable  to  a  much  wider  range  of  logics,  including  the  expres¬ 
sive  logics  implemented  in  modern  systems,  because  it  makes  no  assumptions  about 
the  structure  of  the  graph  (in  particular,  whether  tree  shaped  or  not),  or  the  order  in 
which  the  graph  will  be  constructed.  Secondly,  it  allows  for  the  use  of  more  powerful 
heuristics  that  try  to  improve  typical  case  performance  by  varying  the  global  order  in 
which  different  syntactic  structures  are  decomposed;  in  a  top-down  construction,  such 
heuristics  can  only  operate  on  a  local  region  of  the  graph — typically  a  single  node. 

In  FaCT++  the  ToDo  list  architecture  is  implemented  as  a  set  of  queues  (FIFO 
buffers).  It  is  possible  to  set  a  priority  for  each  rule  type,  and  a  separate  queue  is  im¬ 
plemented  for  each  unique  priority.  Whenever  the  expansion  algorithm  asks  for  a  new 
entry,  it  is  taken  from  the  non-empty  queue  with  the  highest  priority,  and  the  algorithm 
terminates  when  all  the  queues  are  empty.  This  means  that  if  the  3-rule  (the  generating 
rule  that  expands  existential  restrictions)  has  a  low  priority  (say  0),  and  all  other  rules 
have  the  same  priority  (say  1),  then  the  expansion  will  be  (modulo  inverse  roles)  top- 
down  and  breadth  first;  if  stacks  (LIFO  buffers)  were  used  instead  of  queues  with  the 
same  priorities,  then  the  expansion  would  simulate  the  standard  top-down  method. 

3  Optimisations 

Many  of  the  optimisations  used  in  FaCT++  were  taken  from  the  original  FaCT.  These 
include  lexical  normalisation,  boolean  constraint  propagation,  dependency-directed 
backjumping,  caching,  etc.  [1]. 

The  algorithm  implemented  in  FaCT++  is  extended  to  include  native  support  for 
role  range  and  domain  axioms,  which  leads  to  improved  performance  when  reasoning 
with  realistic  ontologies  (which  may  contain  many  such  axioms).  This  extension  can 
be  also  be  exploited  in  order  to  add  a  new  form  of  absorption  optimisation  called  role 
absorption  [16], 

Ordering  Heuristics  As  discussed  in  Section  4,  FaCT++’s  ToDo  list  architecture  al¬ 
lows  for  a  wide  range  of  heuristics  to  be  applied  to  choosing  a  “good”  order  in  which  to 
apply  inference  rules  (we  will  call  these  rule-ordering  heuristics).  Heuristics  can  also  be 
used  to  choose,  for  non-deterministic  rules,  the  order  in  which  to  explore  the  different 
expansion  choices  (we  will  call  these  expansion-ordering  heuristics). 

FaCT++  includes  a  range  of  different  expansion-ordering  heuristics  that  can  be  used 
to  choose  the  order  in  which  to  explore  the  different  expansion  choices  offered  by  the 
non-deterministic  U-rule.  This  ordering  can  be  on  the  basis  of  the  size,  maximum  quan¬ 
tifier  depth,  or  frequency  of  usage  of  each  of  the  concepts  in  the  disjunction,  and  the 
order  can  be  either  ascending  (smallest  size,  minimum  depth  and  lowest  frequency  first) 
or  descending.  In  order  to  avoid  the  cost  of  repeatedly  computing  such  values,  FaCT-f + 
gathers  all  the  relevant  statistics  for  each  concept  as  the  knowledge  base  is  loaded,  and 
caches  them  for  later  use. 

FaCT++  separates  the  classification  process  into  satisfiability  testing  (SAT)  and  sub¬ 
sumption  testing  (SUB)  phases;  the  results  from  the  SAT  phase  are  cached  and  used  to 
speed  up  subsequent  tests  via  a  standard  “model-merging”  optimisation  [9],  It  is  possi¬ 
ble  to  set  different  optimisation  options  for  each  phase  of  the  reasoning  process. 

4  Empirical  Evaluation 

We  have  used  several  real-life  ontologies,  as  well  as  artificial  test  suites,  to  evaluate 
different  optimisations,  and  compare  FaCT-f+’s  performance  with  that  of  other  state- 


of-the-art  DL  reasoners.  Due  to  space  restrictions,  the  results  of  only  a  small  subset  of 
these  tests  can  be  presented  here. 

Ordering  tests  These  tests  illustrate  how  different  expansion  ordering  heuristics  can 
influence  the  reasoner’s  performance.  Several  realistic  KBs  were  used  in  this  test:  Wine- 
Food  is  a  sample  ontology  that  makes  up  part  of  the  OWL  test  suite3  [2];  DOLCE 
is  a  foundational  (top-level)  ontology,  developed  in  the  WonderWeb  project  [3];  and 
GALEN  is  the  anatomical  part  of  the  well-known  medical  terminology  ontology  [14]. 
The  tests  measured  the  time  taken  (in  CPU  seconds)  to  classify  the  ontologies. 


Table  1.  Ontology  classification  times  (seconds)  for  ToDo  (left)  and  OR  (right)  orderings 


KB 

DOLCE 

WineFood 

GALEN 

SAT 

Sa 

Da 

Fa 

Sd 

Dd 

Fd 

SAT 

SUB 

SAT 

SUB 

SAT 

SUB 

SUB 

0.26 

0.29 

0.19 

0.13 

0.13 

0.20 

a 

0.74 

0.74 

0.22 

2.44 

99.44 

1678.11 

Sa 

3.15 

3.57 

3.27 

3.21 

3.21 

3.68 

aO 

0.64 

0.68 

0.14 

1.64 

29.80 

569,  64 

Da 

3.54 

3.57 

3.44 

3.20 

3.40 

3.47 

aEO 

0.58 

0.57 

0.15 

1.67 

9.88 

173.79 

Fa 

3.67 

3.57 

2.32 

2.12 

2.41 

2.35 

aE 

0.60 

0.58 

0.27 

2.87 

13.35 

205.32 

Sd 

1.77 

1.80 

1.71 

1.80 

1.80 

1.83 

aOE 

0.61 

0.59 

0.27 

2.93 

13.22 

201.40 

Dd 

1.69 

1.77 

1.87 

1.66 

1.78 

1.78 

Fd 

2.30 

2.26 

2.75 

3.14 

3.54 

2.76 

In  the  rule-ordering  tests  (left-hand  side  of  Table  1),  each  ordering  strategy  is  shown 
as  a  sequence  of  letters  specifying  the  priorities  (highest  first)  of  the  different  rule  types, 
where  “O”  refers  to  the  U-rule,  “E”  to  the  3-rule,  and  “a”  to  any  other  rule  type.  E.g., 
“aO”  describes  the  strategy  where  the  U-rule  has  the  lowest  priority,  and  all  other  rules 
have  an  equal  higher  priority. 

In  most  cases  the  best  result  is  given  by  the  “aEO”  strategy  (i.e.,  by  assigning  the 
lowest  priority  to  the  U-rule  and  the  next  lowest  priority  to  the  3-rule),  and  even  when 
“aEO”  is  not  the  best  strategy,  the  difference  between  it  and  the  best  strategy  is  very 
small.  Note,  that  such  a  strategy  is  impossible  to  implement  in  a  top-down  approach, 
because  it  is  necessary  to  give  the  lowest  priority  to  the  3-rule. 

Table  1  presents  the  results  of  expansion-ordering  tests  using  the  WineFood  ontol¬ 
ogy.  Each  strategy  is  denoted  by  two  letters,  the  first  of  which  indicates  whether  the 
ordering  is  based  on  concept  size  (“S”),  maximum  depth  (“D”)  or  frequency  of  usage 
(“F”),  and  the  second  of  which  indicates  ascending  (“a”)  or  descending  (“d”)  order. 
When  using  the  best  strategy  (“Sd"  for  SAT  and  “Dd”  for  SUB),  classification  takes 
less  than  half  the  time  taken  when  using  the  worst  strategy. 

InstanceStore  query  answering  The  instanceStore  (iS)  is  a  technique  used  to  deal 
with  ontologies  containing  a  very  large  number  of  individuals.  It  combines  terminolog¬ 
ical  reasoning  with  database  approaches  to  answer  a  limited  form  of  instance-retrieval 
query  against  such  ontologies.  A  more  detailed  description  of  the  iS  can  be  found  in  [8]. 

Here  we  compare  the  performance  of  FaCT  and  FaCT++  in  iS  query  answering. 
The  test  used  the  40,000  concept  Gene  Ontology,  with  roughly  500,000  individuals 
mined  from  the  GO  database  [8].  In  some  cases,  query  answering  may  require  many 
subsumption  tests  to  be  performed  w.r.t.  the  classified  ontology. 

Figure  2  shows  the  results  of  these  tests,  including  the  classification  time  (in  sec¬ 
onds)  for  the  ontology,  the  answer  size  (number  of  instances)  and  the  time  taken  (in 
seconds)  for  each  query.  It  is  easy  to  see  that  FaCT++  is  significantly  better  than  FaCT 

3  This  ontology  therefore  has  a  much  weaker  claim  to  being  “realistic”. 


Table  2.  instanceStore  classification  and  query  answering  times  (seconds) 


Query 

Classification 

Qi 

Q2 

Q3 

Q4 

Q5 

Q6 

Q7 

Q8 

Q9 

Q10 

Qll 

Ans.  size 

— 

6,527 

0 

0 

96,105 

27 

13,449 

11,820 

12 

19 

4,563 

1 

FaCT 

247 

3.1 

37.1 

30.1 

61.2 

0.7 

16.9 

3.2 

87.8 

8.8 

2.1 

2.4 

FaCT++ 

84 

3.9 

10.6 

6.1 

38.5 

0.5 

14.4 

1.9 

31.4 

7.4 

1.3 

0.1 

both  in  classifying  the  (large  but  relatively  simply  structured)  ontology,  and  in  perform¬ 
ing  large  numbers  of  (relatively  easy)  subsumption  tests. 

5  Discussion  and  Future  Directions 

We  have  presented  FaCT++,  an  OWL  Lite  DL  reasoner  which  uses  a  new  ToDo  list 
architecture  and  incorporates  new  optimisations. 

Future  directions  for  FaCT++  include  both  algorithmic  and  technological  improve¬ 
ments.  The  next  version  of  FaCT++  will  support  the  more  expressive  SHOIQ  DL 
needed  by  the  OWL  DL  ontology  language.  Some  new  optimisations,  including  dy¬ 
namic  backjumping  and  more  elaborate  heuristics  are  also  planned.  Regarding  techno¬ 
logical  improvements,  we  plan  to  add  direct  support  for  OWL’s  RDF  syntax,  and  to 
implement  FaCT++  as  a  standalone  HTTP  server. 
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Benchmarking  DL  Reasoners  Using  Realistic 

Ontologies 


Zhengxiang  Pan 

Bell  Labs  Research  and  Lehigh  University 
zhp2@lehigh . edu 


Abstract.  We  did  a  preliminary  benchmark  on  DL  reasoners  using  real 
world  OWL  ontologies.  First  hand  experiences  on  OWL  ontologies  and 
reasoning  services  available  to  OWL  are  described  and  discussed. 


1  Introduction 

We  developed  a  benchmark  to  evaluate  the  performances  of  state-of-the-art  ex¬ 
pressive  Description  Logic  reasoners  using  an  empirical  approach.  The  outcomes 
are  expected  to  illustrate  the  effectiveness  of  DL  reasoners  that  vary  in  opti¬ 
mization  techniques  and  implementation  strategies.  Some  observations  on  the 
deployment  of  OWL  language  are  also  made  and  discussed. 

We  conducted  the  benchmark  toward  three  well-known  DL  reasoners:  Racer, 
FaCT+- f  and  Pellet.  The  benchmark  data  consists  of  more  than  one  hundred 
most  deployed  OWL  ontologies  across  many  domains,  from  life  science  [1]  to 
geographic  [2],  from  food  and  wines  [3]  to  stock  exchange  [4]. 

Because  of  the  heterogeneous  interfaces  (DIG  and  HTTP  etc.)  being  used 
in  the  different  systems,  this  benchmark  should  be  regarded  as  more  qualitative 
than  quantitative,  i.e.  precise  time  counting  is  of  little  interests  here.  Instead, 
we  will  try  to  analyze  the  detailed  outputs  and  logs  of  those  reasoning  systems 
and  extract  useful  informations. 

2  Background 

Due  to  the  correspondence  between  OWL  and  description  logics,  reasoning  sup¬ 
port  to  OWL  applications  heavily  relies  on  description  logic  reasoners.  Bench¬ 
marks  and  evaluations  on  DL  reasoners  thus  become  an  important  task  that 
concerns  with  OWL. 

A  number  of  efforts  have  been  made  on  benchmarking  DL  reasoner.  Some 
of  them  take  the  road  of  generating  synthetic  formulas  randomly  [5]  [6]  [7]. 
Although  this  approach  could  lead  to  a  more  comprehensive  benchmark  on  rea¬ 
soners,  the  parameters  of  those  generators  need  to  be  fine  tuned  by  sampling 
the  realistic  KBs.  In  contrast,  using  the  real  world  KBs  as  test  data  would  be 
more  feasible  and  efficient,  the  results  would  be  more  easily  interpreted  and  used 
by  end  users.  Part  of  the  test  in  [8]  was  using  realistic  TBoxes,  but  only  8  KBs 


were  included  at  that  time.  An  analysis  of  a  DAML+01L  ontology  collection 
in  [9]  characterized  and  categorized  the  real-world  ontologies  but  the  depicted 
benchmark  was  not  implemented  and  conducted. 

3  Experiments 

3.1  Target  Systems 

Three  DL  reasoners  were  chosen  to  run  the  benchmark:  Racer,  Pellet  and  FaCT+- |- 
Certainly  there  are  other  DL  reasoners  worth  benchmarking  but  were  not  tested 
due  to  various  constraints. 

According  to  [10],  Racer  implementation  employed  tableaux  calculus  for 
STLTQ  as  well  as  the  following  optimization  techniques:  dependency-directed 
backtracking  and  DPLL-style  semantic  branching,  transformation  of  axioms 
(GCIs),  model  caching  and  model  merging. 

Pellet  is  claimed  to  be  sound  and  complete  on  STLTJ\f(T>)  and  STLON{T>) 
[11].  It  implements  TBox  partitioning,  absorption  and  lazy  unfolding  plus  depen¬ 
dency  directed  backjumping,  semantic  branching  and  early  blocking  strategies. 
It  also  supports  datatype  reasoning  and  uses  some  optimizations  in  Abox  query 
answering. 

Not  quite  similar  to  FaCT,  FaCT++  implemented  a  new  tableaux  decision 
procedure  for  STiOZQ  [12].  Being  under  its  early  stage  of  development,  FaCT+- 1- 
has  very  limited  user  interface  and  no  API  is  available. 

3.2  Test  Data 

Our  test  data  consists  of  135  real  world  OWL  ontologies.  They  were  submitted 
by  the  ontology  authors  and  users  from  different  domains. 

Our  original  plan  was  to  use  an  OWL-aware  crawler  to  crawl  OWL  ontologies 
available  on  the  web.  However,  we  did  not  find  such  a  tool.  Hence  we  turned 
to  finding  a  large  collection  of  OWL  ontologies.  Among  handful  candidates, 
Schemaweb  [13]  which  is  a  RDF  schemas  directory  became  our  choice.  A  java 
program  was  developed  then  to  read  and  identify  those  schemas. 

We  fed  250  urls  indexed  on  Schemaweb  into  our  program  and  we  identified 
135  OWL  ontologies  among  them.  Totally  5897  classes  and  2601  relations  were 
recorded  out  of  these  OWL  ontologies. 

Having  their  subjects  distributed  in  a  broad  range  of  domains,  these  OWL 
ontologies  also  vary  in  size,  constructs  being  used  and  complexity.  Thus,  we 
argue  these  135  OWL  ontologies  largely  represent  the  current  usage  of  OWL 
language  in  practical  despite  that  they  are  just  a  small  portion  of  existing  OWL 
documents. 

3.3  Experiment  Configurations 

The  experiments  were  done  on  a  Linux  box  featuring  an  Intel(R)  Pentium(R)  4 
CPU  at  2.6GHz  and  1  giga  bytes  main  memory. 


For  each  target  system,  script  or  special  handling  program  were  written  to 
direct  the  benchmark.  No  matter  how  it  was  being  executed,  the  benchmark  is 
the  iteration  of  the  following  steps: 

1.  Clear  the  memory  and  cache  in  the  application. 

2.  Read  in  the  next  ontology  in  the  test  set. 

3.  Do  classification. 

A  time  limit  of  one  hour  (3600  seconds)  was  set  for  each  ontology,  meaning 
that  any  processes  regarding  a  particular  ontology  will  be  aborted  if  aggregated 
CPU  time  exceeds  3600  seconds. 

Racer: 

RacerPro  1.8.1  has  recently  released  as  a  commercial  software;  however  in 
this  paper  we  used  the  last  free-for-research  version  Racer  1.7.  A  racer  in¬ 
struction  file  was  created  to  run  the  benchmark.  Each  ontology  corresponds 
to  four  commands  in  that  file.  First  two  commands  (DELETE- ALL- ABOXES) 
and  (DELETE-ALL-TBOXES)  cleaned  up  the  memory.  Then  (OWL-READ- 
DOCUMENT  ”«ri”)  command  asked  Racer  to  read  in  the  specified  ontology.  At 
the  end,  (TBOX-COHERENT-P)  and  (ABOX-CONSISTENT-P)  invoked  the 
classifications  in  the  Reasoner. 

Pellet: 

We  used  the  Pellet  1.1.0  released  on  05/07/2004.  A  script  file  was  created  to 
manage  the  benchmark.  Each  parameterized  execution  of  Pellet  would  read  one 
ontology  and  do  the  classification.  Since  each  ontology  was  processed  by  a  fresh 
start  of  the  Pellet,  no  need  to  clean  the  memory  and  cache  in  this  case. 

FaCT++: 

As  part  of  the  aforementioned  limitations,  FaCT-|— |-  doesn’t  take  OWL  doc¬ 
uments  directly  nor  any  remote  hies.  A  utility  program  digFaCT-| — f  takes  local 
hies  in  DIG  [14].  In  order  to  make  the  benchmark  working,  we  developed  a  java 
program  to  translate  the  OWL  ontologies  into  DIG  format  and  store  them  lo¬ 
cally.  In  the  benchmark  script,  each  execution  of  digFaCT-| — b  was  supplied  two 
parameters.  One  is  a  tell-document  in  DIG  corresponding  to  one  ontology,  the 
other  is  a  simple  query  hie  that  only  ask  if  TOP  is  satishable.  This  simple  query 
was  used  here  to  invoke  the  classihcation  in  FaCT++. 

4  Results 

Racer  finished  the  benchmark  in  about  15  minutes.  It  successfully  made  TBox 
classihcations  on  108  ontologies,  101  of  which  were  found  to  be  consistent.  It 
also  made  successful  consistency  check  on  ABox  for  92  ontologies,  83  of  which 
were  found  to  be  consistent.  For  those  aborted  tasks  and  inconsistent  ontologies, 
Racer  reported  117  errors,  about  one  third  of  which  is  due  to  the  syntax  errors 
or  usages  beyond  the  scope  of  OWL  DL. 

Pellet  had  done  classihcations  on  103  ontologies  within  the  time  limit.  It  spent 
almost  2  hours  (6814  seconds)  on  these  ontologies.  Interestingly,  all  these  finished 
ontologies  were  classihed  to  be  consistent.  However,  Pellet  automatically  did 


somethings  more  than  just  classification.  It  validates  the  species  of  the  ontologies 
and  tries  to  repair  OWL  Full  ontologies  if  they  are  missing  type  triples  [11].  In 
our  benchmark  result,  70  out  of  103  ontologies  were  validated  as  OWL  Full,  23 
and  10  for  DL  and  Lite  respectively. 

Except  timed  out  for  3  ontologies,  FaCT++  had  done  the  remaining  in  nearly 
30  seconds.  Its  log  recorded  that  it  only  spent  2.6  seconds  on  classifications  of 
the  121  ontologies,  which  were  all  successfully  classified.  Note  the  time  of  parsing 
and  I/O  was  not  included,  nor  was  the  time  spent  on  translating  OWL  into  DIG. 
Nevertheless,  this  kind  of  performance  was  very  impressive. 


System 

Consistent 

Inconsistent 

Timed  out 

Aborted 

Racer(Tbox) 

101 

7 

0 

27 

Pellet 

103 

0 

17 

15 

Fact-1— 1- 

121 

0 

3 

11 

Table  1.  The  Results  of  Classification:  Performed  on  135  ontologies 


5  Discussion 

Here  we  summarize  some  interesting  observations  from  our  benchmark  results. 
They  could  potentially  give  us  some  helpful  hints  on  ontology  authoring  as  well 
as  the  design  and  implementation  of  reasoners. 

Firstly,  the  test  data  and  the  output  of  reasoners  gave  us  a  good  chance 
to  characterize  the  current  usage  of  OWL  language.  Based  on  the  result  from 
Pellet,  more  than  70%  of  the  classified  ontologies  are  OWL  Full,  more  than  three 
quarters  of  these  OWL  Full  ones  can  be  validated  as  OWL  DL  just  by  adding 
some  statements,  like  type  triples.  Racer  also  found  out  22  cases  where  transitive 
properties  were  used  in  cardinality  restrictions,  legitimate  only  in  OWL  Full. 

In  addition,  we  used  the  WonderWeb  validator  [15]  to  validate  the  species 
of  the  rest  ontologies.  By  adding  these  up  we  get  figure  1,  showing  that  the 
majority  of  the  test  KBs  are  OWL  Full.  Note  the  ’’unknown”  category  was  for 
those  ontologies  that  caused  errors  on  the  validator. 

We  assume  that  only  few  authors  intended  to  create  an  ontology  in  OWL 
Full,  because  of  the  extreme  difficulties  in  finding  reasoning  support.  Thus,  an 
ontology  editor  with  built-in  validator  and  heuristic  non-DL  finder  is  highly 
desired  and  should  be  widely  adopted.  Some  efforts  has  been  made  toward  this 
direction,  such  as  [16]. 

Secondly,  the  performances  of  the  reasoners  varied  a  lot.  Although  they  are 
not  directly  comparable  due  to  the  different  input  formats  (OWL  v.s.  DIG)  and 
side-functionalities  (species  checking  etc.),  the  results  implies  the  effectiveness  of 
some  optimization  techniques  being  deployed.  Apparently,  FaCT++  completed 
the  most  testing  ontologies  using  the  least  time.  Within  the  scope  of  classifica¬ 
tion,  FaCT++  significantly  pushed  the  baseline  of  DL  reasoners  to  a  new  high. 


Unknown 


7%  I 


DL 

20% 


Full 

66% 


Fig.  1.  Percentages  of  Each  Species  of  OWL  in  Test  Data 


However,  FaCT++  provides  very  limited  services  compare  to  other  rivals.  So 
far  it  only  supports  queries  through  DIG,  which  is  not  very  expressive  in  ABox 
retrieving. 

Pellet,  on  the  other  hand,  done  least  TBox  classifications  with  most  time. 
Unfortunately,  we  have  no  way  to  figure  out  how  much  time  was  actually  spent 
on  those  extra  functionalities  such  as  species  checking.  Figure  2  shows  that  clas¬ 
sification  time  in  Pellet  increased  nearly  exponentially,  no  wonder  17  testing 
ontologies  were  timed  out.  Interestingly,  we  found  out  that  some  of  the  ontolo¬ 
gies  that  Pellet  spent  a  huge  amount  of  time  on  (but  finished)  were  the  ones 
timed  out  or  failed  by  FaCT++.  This  suggests  that  to  some  extent,  Pellet  is 
more  resilient  to  non-trivial  ontologies. 

For  most  of  the  testing  documents,  Racer  was  not  as  fast  as  FaCT+- 1-,  but 
it  never  timed  out.  This  intrigues  a  dilemma  on  the  implementation  strategies: 
give  time  or  give  up?  Guaranteed  termination  is  a  nice  property  but  sometimes 
resilience  is  also  desired.  One  possible  solution  is  to  allow  users  to  customize  the 
time-out  settings  for  each  execution. 

Above  all,  the  performance  of  a  DL  reasoner  is  affected  by  the  following 
factors: 

—  The  quality  of  inputs.  DL  reasoners  are  not  intended  to  perform  on  non-DL 
ontologies.  Reasoner  should  be  able  to  identify  its  capability  on  the  given 
knowledge  base  before  long  deliberations. 

—  The  optimization  techniques.  Other  experiences  [17]  show  that  cyclic  axioms, 
inverse  roles  and  nominals  are  ’’killer”  constructs  for  DL  reasoners.  New 
optimizations  should  target  these  cases. 


Fig.  2.  Performance  degradation  of  Pellet 


—  The  feasibility  for  customization.  Different  applications  have  different  con¬ 
straints  or  preferences  on  speed  and/or  expressiveness.  They  require  different 
side-functionalities  even  just  for  classification. 


6  Conclusion 


We  performed  a  preliminary  benchmark  on  three  state-of-the-art  DL  reasoners: 
Racer,  Pellet  and  FaCT++.  They  vary  from  each  other  in  many  aspects,  even 
in  programming  languages:  Lisp,  Java  and  C+- 1-  respectively.  Real  world  OWL 
ontologies  across  various  domains  were  used  as  test  KBs.  Observations  on  the 
characteristics  of  those  OWL  ontologies  as  well  as  the  performances  of  the  rea¬ 
soners  were  reported  and  discussed.  We  do  not  intend  to  use  the  results  as  direct 
reflection  of  those  systems’  overall  performances,  for  that  this  simple  benchmark 
is  not  systematic  enough  and  only  focus  on  TBox  classification  at  this  time. 

There  are  a  couple  of  future  directions  on  this  work.  Firstly  we  should  iden¬ 
tify  and  study  case  by  case  on  these  non-trivial  KBs,  i.e.  the  ones  timed  out, 
failed  or  spent  a  lot  of  time.  These  will  intrigue  research  directions  on  DL  rea- 
soner  optimizations.  Secondly  we  need  to  formalize  our  benchmark  by  making 
the  targeting  reasoners  more  comparable,  probably  wrapping  them  in  the  same 
API.  Furthermore,  the  detailed  relationship  between  optimization  techniques 
and  performances  should  be  analyzed. 
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1  Introduction 

Reasoning  for  expressive  DLs  implemented  in  state-of-the-art  systems  has  high 
worst  case  complexity.  The  hope/claim  is,  however,  that  these  systems  perform 
well  in  “realistic”  applications.  In  practice,  this  means  in  ontology  applications. 
To  check  the  validity  of  this  claim  it  is  necessary  to  test  the  performance  of  these 
systems  with  (the  widest  possible  range  of)  ontologies  derived  from  applications. 

In  addition,  testing  is  useful  in  order  to  check  the  correctness  of  implementa¬ 
tions.  In  small  examples,  it  may  be  easy  to  check  the  correctness  of  a  system’s 
reasoning.  However,  for  typical  real-world  examples,  manual  checking  is  not  fea¬ 
sible.  In  these  instances,  the  best  (perhaps  the  only)  way  to  check  correctness  is 
often  by  checking  for  consistency  with  the  reasoning  of  other  existing  systems. 

Real-world  ontologies  vary  considerably  in  their  size  and  expressivity.  While 
they  are  all  valuable  test  cases,  it  is  still  important  to  understand  each  ontology’s 
properties  in  order  to  provide  efficient  and  relevant  testing. 

System  developers  find  this  particularly  useful,  as  it  helps  them  to  identify 
weaknesses  in  their  systems  and  to  devise  and  test  new  optimisations.  Finally, 
testing  is  also  useful  for  the  developers  and  users  of  applications  as  they  can 
use  benchmarking  results  to  determine  if  (the  performance  of)  a  DL  reasoner  is 
likely  to  satisfy  their  requirements,  and  if  so  which  reasoner  is  likely  to  perform 
best  in  their  application. 

2  Background  and  Related  Work 

For  the  above  mentioned  reasons,  there  is  extensive  existing  work  on  benchmark¬ 
ing  DL  (as  well  as  modal  logic)  reasoners.  E.g.,  TANCS  comparisons  and  bench- 
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mark  suites  [10],  DL  comparisons  and  benchmark  suite  [1],  work  on  M-SPASS  [8], 
work  on  FaCT  and  DLP  [7,  6],  the  OWL  benchmark  suite  and  test  results,  and 
various  test  results  from  papers  describing  systems  such  as  FaCT++  (http: 

/ /owl .  man .  ac  .  uk/ f  actplusplus) ,  Pellet  (http :  / / www .  mindswap .  org/2003/ 
pellet/),  Racer  [5],  KAON2  (http://kaon2.semanticweb.org/),  Vampire  [12], 
etc. 

Due  to  the  fact  that  relatively  few  (large  and/or  interesting)  ontologies  were 
available,  earlier  tests  often  used  artificially  generated  test  data.  The  Lehigh 
University  Benchmark  [4],  for  example,  used  a  synthetic  ontology  and  randomly 
generated  data  to  test  the  capabilities  of  knowledge  base  systems  using  specific 
weightings  to  compare  systems  on  characteristics  of  interest.  Results  from  such 
tests  are,  however,  of  doubtful  relevance  when  gauging  performance  on  ontolo¬ 
gies.  The  popularity  of  OWL  has  meant  that  many  more  ontologies  are  now 
available,  and  recent  benchmarking  work  has  focused  on  testing  performance 
with  such  ontologies. 

One  such  example  [11]  involved  benchmarking  a  number  of  reasoners  against 
a  broad  range  of  realistic  ontologies.  However,  not  all  reasoners  used  in  that 
comparison  supports  OWL  as  an  input  language,  so  quantitative  comparison  of 
performance  would  have  been  difficult/un-justihed.  From  the  other  hand,  the 
DIG  interface  [2]  is  recognised  as  a  preferred  choice  by  application  developers 
and  thus  is  implemented  into  a  wide  range  of  DL  Reasoners. 

Our  work  builds  on  these  earlier  efforts,  taking  advantage  of  the  DIG  stan¬ 
dard  to  provide  a  generic  benchmarking  suite  that  allows  the  automatic  quan¬ 
titative  testing  and  comparison  of  DL  Reasoners  on  real-world  examples  with 
relevant  properties.  We  aim  to  make  the  testing  process  as  autonomous  as  possi¬ 
ble,  taking  care,  for  example,  of  (re)starting  and  stopping  reasoners  as  necessary, 
and  the  analysis  of  results  be  as  flexible  as  possible,  by  allowing  for  arbitrary 
SQL  queries  against  the  collected  data.  We  also  aim  to  provide,  as  a  publicly 
available  resource,  a  library  of  test  ontologies  where  each  ontology  has  been 
checked  for  expressivity  and  syntactic  conformance,  translated  into  DIG  syntax 
(which  is  much  easier  to  work  with  that  OWL’s  RDF/XML  syntax),  and  in¬ 
cludes  (where  possible)  results  (such  as  the  concept  hierarchy)  that  can  be  used 
for  testing  the  correctness  of  reasoning  systems. 


3  Methodology 

The  system  has  two  main  functions.  The  first  is  to  process  ontologies  and  add 
them  to  the  library,  and  the  second  is  to  benchmark  one  or  more  reasoners  using 
the  ontology  library. 

When  processing  ontologies,  the  system  takes  as  input  a  list  of  OWL-ontology 
URI’s.  Before  they  can  be  used  in  testing,  some  preprocessing  of  these  ontologies 
is  required.  The  process  involves  generating  valuable  meta-data  about  each 


ontology,  as  well  as  converting  each  of  the  OWL-ontologies  to  DIG. 

The  meta-data  is  generated  by  code  written  for  SWOOP  [9],  and  provides  the 
details  of  the  expressivity  (i.e.  the  constructs  present  in  the  ontology)  together 
with  the  number  of  classes,  object  properties,  data  properties,  individuals,  class 
axioms,  property  axioms  and  individual  axioms  present.  The  OWL-to-DIG  con¬ 
version  uses  the  OWL-API  (http://owlapi.sourceforge.net).  This  process 
is  far  from  trivial  as  OWL’s  RDF  syntax  is  extremely  complex,  and  it  is  easy 
to  (inadvertently)  cause  ontologies  to  be  outside  of  OWL  DL,  e.g.,  by  simply 
forgetting  to  explicitly  type  every  object.  Moreover,  the  DIG  interface  supports 
only  the  most  basic  of  data  types,  such  as  Strings  and  Integers.  The  result  is 
that  many  of  the  available  OWL  Ontologies  we  found  could  not  be  successfully 
converted  to  DIG. 

Local  copies  are  stored  of  both  the  OWL  Ontology  and  the  DIG  version. 
This  is  not  only  for  efficiency  during  the  testing,  but  also  to  ensure  consistency 
(as  online  ontologies  rarely  remain  static).  Moreover,  this  allows  us  to  fix  trivial 
errors  in  the  OWL  ontologies  so  that  they  can  be  used  for  testing  purposes.  The 
locations  of  these  hies,  together  with  their  properties/meta-data,  are  stored  as 
database  entries  for  easy  access  and  manipulation. 

The  main  function  of  the  benchmark  suite  itself  is  timing  the  classification 
of  each  ontology  by  each  Reasoner.  CPU  time  is  the  measure  used  as  it  elimi¬ 
nates  external  factors  such  as  background  threads,  and  to  promote  fairness,  each 
Reasoner  is  terminated  and  then  restarted  for  every  test. 

A  problem  with  trying  to  compare  different  Reasoners  is  that  they  may 
perform  tasks  in  different  ways.  For  example,  they  may  vary  in  the  way  in 
which  they  perform  each  part  of  the  reasoning:  some  may  take  an  “eager” 
approach,  fully  classifying  the  whole  ontology  and  caching  the  results  as  soon 
as  it  is  received;  others  may  take  a  “lazy”  approach,  only  performing  reasoning 
tasks  as  required  in  order  to  answer  queries.  To  try  to  get  around  this  problem, 
we  use  a  five  step  test  that  forces  every  reasoners  to  fully  classify  each  ontology. 
The  tests  are  as  follows: 

1.  TELLS  the  reasoner  the  full  ontology 

2.  ASK  for  all  the  concepts  in  the  ontology 

3.  ASK  for  the  satisfiability  of  the  TOP  concept 

4.  ASK  for  the  satisfiability  of  all  the  concepts  in  the  ontology 

5.  ASK  for  the  ontology  taxonomy  (parents  and  children  of  all  concepts) 

Each  of  these  individual  tests  are  then  timed,  providing  interesting  information 
about  when  different  reasoners  do  most  their  work.  It  is,  however,  the  total  time 
for  this  complete  classification  that  we  are  most  interested  in. 

Each  classification  will  then  end  in  one  of  three  ways.  It  will  either  complete 
successfully,  fail  due  to  lack  of  time  or  fail  for  some  other  reasons.  The  latter 
may  include  fail  due  to  lack  of  run-time  memory,  fail  because  the  reasoner  could 
not  parse  the  ontology  successfully,  etc. 

The  benchmark  suite  is  fully  automatic,  dealing  with  most  errors  autonomously, 


meaning  that  the  testing  can  be  left  to  run  over-night  or  over  a  week-end  (which 
may  be  necessary  when  using  a  large  time-out).  All  data  is  recorded  in  a  MySQL 
database,  making  it  easy  to  the  user  to  view  and  analyse  the  data  in  a  variety 
of  ways. 

As  discussed  in  Section  1,  in  order  to  get  a  clearer  indication  of  how  DL 
Reasoners  perform  in  the  real  world,  we  aim  to  build  a  large  library  of  OWL 
ontologies  from  those  that  are  publicly  available.  Currently,  our  library  contains 
a  little  over  300  OWL-RDF  Ontologies,  but  only  172  of  these  could  successfully 
be  converted  to  DIG.  This  has,  however,  provided  us  with  a  total  of  just  under 
72,000  classes  and  over  30,000  individuals  in  a  DIG  format.  Only  18%  of  the 
ontologies  were  at  least  ALC,  which  suggests  that  the  majority  of  real-world 
ontologies  aren’t  in  fact  very  complex,  but  it  also  means  we  have  a  comfortable 
number  of  “interesting”  examples  too. 


4  Testing 

Our  system  is  currently  fully  automatic  and  runs  the  classification  tests  success¬ 
fully  through  our  whole  library.  It  does  not,  however,  at  this  stage  verify  the 
correctness  of  each  Reasoner’s  answers  to  the  queries  and  how  they  compare  to 
the  answers  given  by  other  Reasoners.  This  means  that  our  measure  of  success 
is,  for  now,  merely  an  indication  that  the  Reasoner  received  and  parsed  the 
DIG  successfully  and  returned  a  valid  DIG  response.  This  is  generally  a  good 
indication,  but  should  only  be  considered  a  preliminary  result. 

We  have  performed  some  tests  on  our  system,  as  it  stands,  and  we  provide 
here  some  examples  of  the  kinds  of  information  that  our  system  can  produce. 

FaCT-t— |-  vl.1.2,  KAON2,  Pellet  v2.2  and  RacerPro  vl.9  are  four  of  the  most 
widely  used  OWL/DIG  reasoners,  and  we  therefore  decided  to  use  these  to  test 
the  current  capabilities  of  our  system.  The  tests  were  performed  using  an  Intel 
Pentium  M  Processor  1.60  GHz  and  1Gb  of  Main  Memory  on  Windows  XP. 
The  time-out  period  was  set  to  one  hour  (in  real  time).  Pellet  and  KAON2  are 
java  applications,  and  for  these  tests  were  run  with  a  maximum  heap  space  of 
200Mb.  RacerPro  and  FaCT+- 1-  were  left  to  run  on  their  default  settings. 

Table  1  shows  how  the  individual  Reasoners  performed  firstly  on  all  our 
ontologies  and  then  on  Ontologies  which  have  particular  characteristics.  Finally, 
it  shows  their  performance  on  OWL-Lite  ontologies,  which  includes  all  those  with 
expressivity  up  to  SHIF. 

In  order  to  determine  which  were  the  most  “challenging”  ontologies  (w.r.t. 
reasoning),  we  tried  to  order  ontologies  according  to  the  difficulty  of  reasoning 
with  them.  To  do  this,  we  used  all  the  ontologies  that  were  successfully  classified 
by  at  least  two  Reasoners  and  then  ordered  these  by  their  average  classification 
time.  Figure  1  shows  the  amount  of  time  each  Reasoner  took  to  classify  the 
10  most  challenging  ontologies  according  to  this  measure  (where  negative  time 
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9 
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9 

3 

0 
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0 
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0 

TransRoles 

Success 

17 

10 

20 

16 
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3 

12 

2 

6 
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TimedOut 

2 

2 

0 

0 

Datatypes 

Success 

90 

21 
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87 
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35 
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4 
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0 

0 
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32 

36 

39 

39 
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5 

4 

0 
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3 

0 

1 

0 

Table  1:  Sample  of  Overall  Performance 
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Figure  1:  Comparison  of  Reasoners  on  the  Top  10  Most  Challenging  Ontologies 
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1 

DLLite 
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http  ://...logy/nciOncology.  owl 

2 

ALR+ 

20526 
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http:/ / archive.godatabase.org/ 
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SHIN 

395 
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http:/ /...gies/tambis-full.owl 

4 

RDFS(DL) 

1108 

3635 

http://... world- fact-book. darnl 
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RDFS(DL) 
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1900 

http:/ /...nt/AirportCodes. darnl 
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SHF 
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ALR+HI(D) 
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35 
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Table  2:  Properties  of  Top  10  Most  Time-consuming  Ontologies 


Reasoner 

Tells 

ConceptList 

SatOfTop 

SatOfClasses 

Hierarchy 

FaCT++ 

6% 

16% 

16% 

11% 

51% 

KAON2 

13% 

9% 

8% 

26% 

44% 

Pellet 

24% 

14% 

12% 

18% 

33% 

RacerPro 

33% 

7% 

11% 

13% 

36% 

Table  3:  Average  Division  of  Task  Time 


represents  a  failure  to  classify).  Please  note  that  the  times  are  given  in  CPU 
time  and  are  considerably  lower  than  the  real  time  taken  for  classification  (as  a 
Windows  machine  will  typically  have  hundreds  of  threads  running  concurrently). 
Table  2  then  shows  some  of  the  interesting  information  that  is  available  on  these 
“Top  10”  Ontologies. 

This  table  is  useful  in  helping  us  understand  what  makes  these  particular 
Ontologies  so  time-consuming  to  reason  over.  In  the  case  of  the  NCI  and  Gene 
Ontology’s  (1st  and  2nd),  it  can  be  clearly  seen  that  it  is  their  shear  size  that 
provides  the  challenge.  The  4th  and  5th  (world-fact-book  and  AirportCodes) 
make  up  for  their  number  of  classes  with  an  extensive  array  of  individuals. 
Whereas  Tambis  (3rd)  simply  uses  some  very  complicated  constructs. 

Our  final  table,  Table  3,  shows  the  average  proportion  of  each  classification 
test  that  each  Reasoner  spent  on  the  separate  tasks.  This  shows,  for  example, 
that  RacerPro  performs  a  lot  of  caching  on  receiving  the  Ontology  (TELLS), 
while  FaCT++  does  nothing  until  the  first  ASK  query. 


5  Discussion 


As  we  mentioned  in  the  introduction,  testing  is  useful  for  reasoner  and  tool 
developers  as  well  as  for  users.  Building  on  existing  work,  we  have  developed 


a  system  for  testing  reasoners  with  available  ontologies.  The  benefits  of  our 
approach  include  autonomous  testing,  flexible  analysis  of  results  and  the  devel¬ 
opment  of  a  test  library  that  should  be  a  valuable  resource  for  both  the  DL 
and  ontology  community.  We  will  continue  to  extend  the  library,  and  will  add 
classification  results  from  tested  reasoners  so  that  correctness  testing  can  also 
be  performed. 

While  there  are  an  increasingly  large  array  of  OWL-Ontologies  available  for 
public  use,  other  Ontology  formats  (e.g.  OBO:  the  Open  Biomedical  Ontologies, 
http :  / / obo  .  sourcef  orge  .  net)  are  still  widely  in  use  and  would  make  for  valu¬ 
able  test  examples.  It  is  also  the  case,  as  describe  in  [3],  that  a  large  proportion 
of  the  available  OWL-Full  Ontologies,  could  in  fact  be  validated  as  OWL-DL, 
just  by  adding  a  few  extra  clarifying  statements.  This  means  that  of  the  162 
Ontologies  that  we  had  to  throw  away,  many  could  be  useful  examples  with  a 
little  work.  In  the  future  we  hope  to  use  these  observations,  together  with  any 
external  contributions,  to  considerably  increase  the  size  of  our  ontology  library. 

The  results  produced  by  our  tests  provide  an  interesting  insight  into  the  vari¬ 
ety  and  depth  of  information  that  can  be  extracted  from  such  testing/benchmarking. 
However,  for  the  system  and  it’s  results  to  become  a  valuable  resource,  we  need 
to  test  their  correctness.  We  are  currently  assuming  that  both  the  OWL-to-DIG 
conversions  and  the  Reasoner’s  responses  are  all  valid  and  correct. 

With  regard  to  the  OWL-API’s  conversions,  this  was  the  utility  built  along¬ 
side  the  original  DIG  specification.  We  therefore  argue  that  this  is  the  best 
conversion  available  and  that  our  assumption  is  justified. 

Regarding  the  responses,  as  discussed  earlier,  they  can  be  almost  impossible 
to  check  for  correctness.  Our  best  option  is  therefore  to  analyse  the  difference  in 
responses  received  from  different  reasoners,  and  this  route  is  thus  one  we  aim  to 
explore  further.  It  will  be  interesting  to  see  if  reasoners  (that  should,  in  theory, 
all  produce  the  same  inferences  to  the  same  problems)  will  actually  agree  on  the 
test  ontologies. 

So  far  we  have  focused  on  testing  Tbox  reasoning  (classification).  Although 
the  use  of  nominals  in  SHOIQ  blurs  the  separation  between  Tbox  and  Abox, 
it  would  still  be  useful  to  explicitly  test  Abox  reasoning,  e.g.,  by  asking  for 
the  instances  of  some  query  class.  This  functionality  will  be  added  in  a  future 
version  of  the  system. 

Apart  from  the  future  work  described  above,  there  are  a  number  of  exten¬ 
sions  to  our  benchmarking  system  that  would  enhance  its  utility.  Allowing  users 
to  define  their  own  customised  test,  rather  than  the  5  step  classification  we  are 
using,  is  one  example  that  would  allow  Reasoner  developers  to  test  specific  op¬ 
timisations  and  implementations  as  they  are  developed.  Other  relevant  tests 
would  include  testing  how  multiple  concurrent  tests  on  a  Reasoner  affects  per¬ 
formance,  as  well  as  simply  not  restarting  a  Reasoner  between  tests. 

We  intend  for  the  whole  system,  including  the  ontology  library,  to  be  available 
for  open-source  use  in  the  near  future. 
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Abstract 

We  describe  an  extension  of  the  description  logic  under¬ 
lying  OWL-DL,  STLOTN,  with  a  number  of  expressive 
means  that  we  believe  will  make  it  more  useful  in  prac¬ 
tice.  Roughly  speaking,  we  extend  STLOTN  with  all 
expressive  means  that  were  suggested  to  us  by  ontology 
developers  as  useful  additions  to  OWL-DL,  and  which, 
additionally,  do  not  affect  its  decidability  and  practica¬ 
bility.  We  consider  complex  role  inclusion  axioms  of 
the  form  RoSE.RorSoRn.Rto  express  prop¬ 
agation  of  one  property  along  another  one,  which  have 
proven  useful  in  medical  terminologies.  Furthermore, 
we  extend  STLOTN  with  reflexive,  antisymmetric,  and 
irreflexive  roles,  disjoint  roles,  a  universal  role,  and  con¬ 
structs  BR.Self,  allowing,  for  instance,  the  definition 
of  concepts  such  as  a  “narcist”.  Finally,  we  consider 
negated  role  assertions  in  Aboxes  and  qualified  number 
restrictions.  The  resulting  logic  is  called  S1ZOTQ. 

We  present  a  rather  elegant  tableau-based  reasoning  al¬ 
gorithm:  it  combines  the  use  of  automata  to  keep  track 
of  universal  value  restrictions  with  the  techniques  de¬ 
veloped  for  STLOTQ.  The  logic  S1ZOTQ  has  been 
adopted  as  the  logical  basis  for  the  next  iteration  of 
OWL,  OWL  1.1. 

Introduction 

We  describe  an  extension,  called  SIZOIQ,  of  the  descrip¬ 
tion  logics  (DLs)  STLOTN  (Horrocks,  Sattler,  &  Tobies, 
1999)  underlying  OWL-DL  (Horrocks,  Patel-Schneider,  & 
van  Harmelen,  2003)1  and  1Z1Q  (Horrocks  &  Sattler,  2004). 
STLOTN  provides  most  expressive  means  that  one  could 
reasonably  expect  from  the  description-logical  basis  of  an 
ontology  language,  and  was  designed  to  constitute  a  good 
compromise  between  expressive  power  and  computational 
complexity/practicability  of  reasoning.  It  lacks,  however, 
e.g.  qualified  number  restrictions  which  are  present  in  the 
DL  considered  here  since  they  are  required  in  various  appli¬ 
cations  (Wolstencroft  el  al.,  2005)  and  do  not  pose  problems 
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*OWL  also  includes  datatypes,  a  simple  form  of  concrete  do¬ 
main  (Baader  &  Hanschke,  1991).  These  can,  however,  be  treated 
exactly  as  in  STLOQ(T))ISTLOQ(T)n)  (Horrocks  &  Sattler,  2001; 
Pan  &  Horrocks,  2003),  so  we  will  not  complicate  our  presentation 
by  considering  them  here. 


(Horrocks  &  Sattler,  2005).  That  is,  we  extend  STLOTQ — 
which  is  STLOTN  with  qualified  number  restrictions — and 
extend  the  work  begun  in  Horrocks,  Kutz,  &  Sattler  (2005). 

Since  OWL-DL  is  becoming  more  widely  used,  it  turns 
out  that  it  lacks  a  number  of  expressive  means  which — 
when  considered  carefully — can  be  added  without  causing 
too  much  difficulties  for  automated  reasoning.  We  extend 
STLOTQ  with  these  expressive  means  and,  although  they 
are  not  completely  independent  in  that  some  of  them  can 
be  expressed  using  others,  first  present  them  together  with 
some  examples.  Recall  that,  in  STLOTQ ,  we  can  already 
state  that  a  role  is  transitive  or  the  subrole  or  the  inverse  of 
another  one  (and  therefore  also  that  it  is  symmetric). 

In  addition,  SIZOTQ  allows  for  the  following: 

1.  disjoint  roles.  Most  DLs  can  be  said  to  be  “unbalanced” 
since  they  allow  to  express  disjointness  on  concepts  but 
not  on  roles,  despite  the  fact  that  role  disjointness  is  quite 
natural  and  can  generate  new  subsumptions  or  inconsis¬ 
tencies  in  the  presence  of  role  hierarchies  and  number  re¬ 
strictions.  E.g.,  the  roles  sister  and  mother  should  be 
declared  as  being  disjoint. 

2.  reflexive,  irreflexive,  and  antisymmetric  roles.  These  fea¬ 
tures  are  of  minor  interest  when  considering  only  TBoxes 
not  using  nominals,  yet  they  add  some  useful  constraints 
if  we  also  refer  to  individuals,  either  by  using  nominals  or 
ABoxes,  especially  in  the  presence  of  number  restrictions. 
E.g.,  the  roles  knows,  hasSibling,  and  properPartOf, 
should  be  declared  as,  respectively,  reflexive,  irreflexive, 
and  antisymmetric. 

3.  negated  role  assertions.  Most  Abox  formalisms  only  al¬ 
low  for  positive  role  assertions  (with  few  exceptions  (Are- 
ces  et  al,  2003;  Baader  et  al.,  2005)),  whereas  S1ZOTQ 
also  allows  for  statements  like  (John,  Mary)  :  ^likes. 
In  the  presence  of  complex  role  inclusions,  negated  role 
assertions  can  be  quite  useful  and,  like  disjoint  roles,  they 
overcome  a  certain  asymmetry  in  expressivity. 

4.  S1ZOTQ  provides  complex  role  inclusion  axioms  of  the 
form  R  o  S  C  R  and  S  o  R  O  R  that  were  first  intro¬ 
duced  in  1ZIQ.  For  example,  w.r.t.  the  axiom  owns  o 
hasPart  C  owns,  and  the  fact  that  each  car  contains  an 
engine  Car  C  ElhasPart. Engine,  an  owner  of  a  car  is 
also  an  owner  of  an  engine,  i.e.,  the  following  subsump¬ 
tion  holds:  Bowns.Car  C  3owns. Engine. 


5.  S1ZOTQ  provides  the  universal  role  U.  Together  with 
nominals  (which  are  also  provided  by  STtOIQ),  this 
role  is  a  prominent  feature  of  hybrid  logics  (Blackburn  & 
Seligman,  1995).  Nominals  can  be  viewed  as  a  powerful 
generalisation  of  ,4  Box  individuals  (Schaerf,  1994;  Hor- 
rocks  &  Sattler,  2001),  and  they  occur  naturally  in  ontolo¬ 
gies,  e.g.,  when  describing  a  class  such  as  EUCountries 
by  enumerating  its  members. 

6.  Finally,  S1ZOTQ  allows  for  concepts  of  the  form  3/?. Self 
which  can  be  used  to  express  “local  reflexivity”  of  a  role 
R ,  e.g.,  to  define  the  concept  “narcist”  as  3likes.Self. 

Besides  a  Tbox  and  an  Abox,  S1ZOTQ  provides  a  so-called 
Rbox  to  gather  all  statements  concerning  roles. 

STZOTQ  is  designed  to  be  of  similar  practicability  as 
SHOTQ.  The  tableau  algorithm  for  STZOTQ  presented 
here  is  essentially  a  combination  of  the  algorithms  for  TZTQ 
and  SHOTQ.  In  particular,  it  employs  the  same  technique 
using  finite  automata  as  in  Horrocks  &  Sattler  (2004)  to  han¬ 
dle  role  inclusions  R  o  S  C  R  and  S  o  R  C  R.  Even 
though  the  additional  expressive  means  require  certain  ad¬ 
justments,  these  adjustments  do  not  add  new  sources  of  non¬ 
determinism  and,  subject  to  empirical  verification,  are  be¬ 
lieved  to  be  “harmless”  in  the  sense  of  not  significantly  de¬ 
grading  typical  performance  as  compared  with  the  SHOTQ 
algorithm.  Moreover,  the  algorithm  for  SIZOIQ  has,  simi¬ 
lar  to  the  one  for  SHOTQ ,  excellent  “pay  as  you  go”  char¬ 
acteristics.  For  instance,  in  case  only  expressive  means  of 
SHTQ  are  used,  the  new  algorithm  will  behave  just  like  the 
algorithm  for  SHTQ. 

We  believe  that  the  combination  of  properties  described 
above  makes  STZOTQ  a  very  useful  basis  for  future  exten¬ 
sions  of  OWL  DL. 

The  Logic  SIZOIQ 

In  this  section,  we  introduce  the  DL  STZOTQ.  This  includes 
the  definition  of  syntax,  semantics,  and  inference  problems. 

Roles,  Role  Hierarchies,  and  Role  Assertions 

Definition  1  Let  C  be  a  set  of  concept  names  including  a 
subset  N  of  nominals,  R  a  set  of  role  names  including  the 
universal  role  U,  and  I  =  {a,  b,c . . .}  a  set  of  individual 
names.  The  set  of  roles  is  R  U  { R~  j  R  £  R},  where  a  role 
R~  is  called  the  inverse  role  of  R. 

As  usual,  an  interpretation  T  =  (Ax, -x)  consists  of  a 
set  Ax,  called  the  domain  of  T,  and  a  valuation  x  which 
associates,  with  each  role  name  R,  a  binary  relation  Rx  C 
Ax  X  Ax  with  the  universal  role  U  the  universal  relation 
Ax  x  Ax  with  each  concept  name  C  a  subset  C'x  C  Ax, 
where  Cx  is  a  singleton  set  if  C  £  N,  and,  with  each  in¬ 
dividual  name  a,  an  element  a x  £  Ax.  Inverse  roles  are 
interpreted  as  usual,  i.e.,  for  each  role  R  £  R  we  have 

(R-)x  =  {(y,x)  I  (x,y)  £  Rx}. 

Obviously,  ( U~)x  =  {U)x .  Note  that,  unlike  in  the  cases 
of  SHTQ  or  SHOTQ,  we  did  not  introduce  transitive  role 
names.  This  is  because,  as  will  become  apparent  below,  role 
box  assertions  can  be  used  to  force  roles  to  be  transitive. 


To  avoid  considering  roles  such  as  R  ,  we  define  a  func¬ 
tion  Inv  on  roles  such  that  Inv(R)  =  R~  if  R  £  R  is  a  role 
name,  and  Inv(R)  =  S  £  R  if  R  =  S~ . 

Since  we  will  often  work  with  finite  strings  of  roles  it  is 
convenient  to  extend  both  x  and  Inv(-)  to  such  strings:  if 
w  =  R\ . . .  Rn  is  a  string  of  roles  f?,;  (1  <  i  <  n),  we  set 
lnv(u>)  =  lnv(i?„) . . .  Inv(ft’i)  and  wx  =  Rx  o  . . .  o  Rx, 
where  o  denotes  composition  of  binary  relations. 

A  role  box  TZ  consists  of  two  components.  The  first  com¬ 
ponent  is  a  role  hierarchy  TZj,  which  consists  of  (gener¬ 
alised)  role  inclusion  axioms.  The  second  component  is  a 
set  7 Za  of  role  assertions  stating,  for  instance,  that  a  role  R 
must  be  interpreted  as  an  irreflexive  relation. 

We  start  with  the  definition  of  a  (regular)  role  hierarchy 
whose  definition  involves  a  certain  ordering  on  roles,  called 
regular.  A  strict  partial  order  -<  on  a  set  A  is  an  irreflexive 
and  transitive  relation  on  A.  A  strict  partial  order  A  on  the 
set  of  roles  R  U  {R~  \  R  £  R}  is  called  a  regular  order 
if  A  satisfies,  additionally,  S  A  It  4=A  S  A  R,  for  all 
roles  R  and  S.  Note,  in  particular,  that  the  irreflexivity  of  A 
ensures  that  neither  S  A  S  nor  S  A  S~  hold. 

Definition  2  ((Regular)  Role  Inclusion  Axioms)  Let  A  be 

a  regular  order  on  roles.  A  role  inclusion  axiom  (RIA  for 
short)  is  an  expression  of  the  form  w  C  R,  where  w  is  a 
finite  string  of  roles  not  including  the  universal  role  U,  and 
R  f  U  is  a  role  name.  A  role  hierarchy  'JZ/,  is  a  finite  set 
ofRIAs.  An  interpretation  T  satisfies  a  role  inclusion  axiom 
w  Q  R,  written  T  \ =  w  Q  R,  if  wx  C  Rx.  An  interpretation 
is  a  model  of  a  role  hierarchy  R/,  if  it  satisfies  all  RIAs  in 
Rh,  written  T  |=  Rh. 

A  RIA  wQRis  A- regular  if  R  is  a  role  name,  and 

1.  w  =  RR,  or 

2.  w  =  R~ ,  or 

3.  w  =  Si . . .  Sn  and  Si  A  R,  for  all  1  <  i  <  n,  or 

4.  w  =  RS\ . . .  Sn  and  Si  A  R,  for  all  1  <  i  <  n,  or 

5.  w  =  Si . . .  SnR  and  Si  A  R,  for  all  1  <  i  <  n. 

Finally,  a  role  hierarchy  TZ^  is  regular  if  there  exists  a  reg¬ 
ular  order  A  such  that  each  RIA  in  TZh  is  -<-regular. 

Regularity  prevents  a  role  hierarchy  from  containing 
cyclic  dependencies.  For  instance,  the  role  hierarchy 

{RS  t  s,  RTt  R,  VT  CT,  TSC  V} 

is  not  regular  because  it  would  require  A  to  satisfy  S  A  V  A 
T  A  Ji  A  S,  which  would  imply  S  A  S,  thus  contradicting 
the  irreflexivity  of  A.  Such  cyclic  dependencies  are  known 
to  lead  to  undecidability  (Horrocks  &  Sattler,  2004). 

Also,  note  that  RIAs  of  the  form  RR~  C  R,  which  would 
imply  (a  weak  form  of)  reflexivity  of  R,  are  not  regular 
according  to  the  definition  of  regular  orderings.  However, 
the  same  condition  on  R  can  be  imposed  by  using  the  GCI 
3R.T  t  3f?.Self;  see  below. 

From  the  definition  of  the  semantics  of  inverse  roles,  it 
follows  immediately  that  (x,y)  £  wx  iff  (y,x)  £  In v(w)x. 
Hence,  each  model  satisfying  w  T  S  also  satisfies  lnv(w)  C 
Inv(S')  (and  vice  versa),  and  thus  the  restriction  to  those 


RIAs  with  only  role  names  on  their  right  hand  side  does  not 
have  any  effect  on  expressivity. 

Given  a  role  hierarchy  7 2/,,  we  define  the  relation  E  to  be 
the  transitive-reflexive  closure  of  E  over  {7?  E  S,  lnv(7?)  E 
Inv(S')  |  R  E  S  £  7 Zh}-  A  role  R  is  called  a  sub-role 
(super-role)  of  a  role  S  if  R  E  S  (S  E  R).  Two  roles  R  and 
S  are  equivalent  (7?  =  S)  if  R,  E  S  and  S  E  7?. 

Note  that,  due  to  restriction  (3)  in  the  definition  of  -<- 
regularity,  we  also  restrict  E  to  be  acyclic,  and  thus  regular 
role  hierarchies  never  contain  two  equivalent  roles.2 

Next,  let  us  turn  to  the  second  component  of  Rboxes,  the 
role  assertions.  For  an  interpretation  X,  we  define  Diag1 
to  be  the  set  {(x,x)  \  x  £  A1}.  Note  that,  since  the 

interpretation  of  the  universal  role  U  is  fixed  in  any  given 
model  (as  the  universal  relation  on  A7  x  A1  which  is,  by 
definition,  reflexive,  symmetric,  and  transitive),  we  disallow 
the  universal  role  to  appear  in  role  assertions. 

Definition  3  (Role  Assertions)  For  roles  R,S  /  U,  we 

call  the  assertions  Ref  (7?),  lrr(7?),  Sym  (7?),  Asy(7?), 
Tra(7?),  and  Dis(7?,  S),  role  assertions,  where,  for  each  in¬ 
terpretation  X  and  all  x,  y,  z  £  A1,  we  have: 

X  \=  Sym (7?)  if  (x,y)  £  R1  implies  (y,x)  £  T?7; 

X  |=  Asy(7?)  if  (x,  y)  £  R1  implies  ( y ,  x)  R1 

X  |=  Tra(7?)  if  (x,  y)  £  R1  and  (y,  z)  £  R1 

imply  (x,  z)  £  R1 ; 

X  |=  Ref(7?)  if  Diag1  C  Rx: 

X  |=  lrr(7?)  if  Rx  3  Diag1  =  0; 

X  |=  Dis (7?,  S)  if  Rx  n  S1  =  0. 

Adding  symmetric  and  transitive  role  assertions  is  a  trivial 
move  since  both  of  these  expressive  means  can  be  replaced 
with  complex  role  inclusion  axioms  as  follows:  Sym (7?)  is 
equivalent  to  R~  E  R  and  Tra(7?)  is  equivalent  to  It  7?  E  R. 

Thus,  as  far  as  expressivity  is  concerned,  we  can  assume, 
for  convenience,  that  no  role  assertions  of  the  form  Traf  It) 
or  Sym  (7?)  appear  in  7 2a,  but  that  transitive  and/or  symmet¬ 
ric  roles  will  be  handled  by  the  RIAs  alone.  In  particular, 
notice  that  regularity  of  a  role  hierarchy  is  preserved  when 
replacing  such  role  assertions  with  the  corresponding  RIAs. 

The  situation  is  different,  however,  for  the  other  Rbox  as¬ 
sertions.  None  of  reflexivity,  irreflexivity,  antisymmetry  or 
disjointness  of  roles  can  be  enforced  by  role  inclusion  ax¬ 
ioms.  However,  as  we  shall  see  later,  reflexivity  and  irreflex¬ 
ivity  of  roles  are  closely  related  to  the  new  concept  37?. Self. 

Note  that  the  version  of  antisymmetry  introduced  above  is 
strict  in  the  sense  that  it  also  implies  irreflexivity  as  opposed 
to  the  more  widely  used  notion  of  antisymmetry  which  al¬ 
lows  for  reflexive  points.  For  instance,  in  mereology,  the 
relation  PartOf  is  usually  assumed  to  be  ‘reflexive  antisym¬ 
metric’  (i.e.,  reflexivity,  plus  xRy  and  yRx  implies  x  -  y), 
while  the  relation  properPartOf  is  assumed  to  be  ‘irreflex- 
ive  antisymmetric’  (defined  just  as  antisymmetry  above)  (Si¬ 
mons,  1987;  Casati  &  Varzi,  1999).  The  more  general  ver- 

2This  is  not  a  serious  restriction  for,  if  72  contains  E  cycles, 
we  can  simply  choose  one  role  R  from  each  cycle  and  replace  all 
other  roles  in  this  cycle  with  R  in  the  input  Rbox.  Tbox,  and  Abox. 


sion  of  antisymmetry  is  more  difficult  to  handle  algorithmi¬ 
cally,  and  we  leave  this  to  future  work. 

In  SHIQ  (and  SHOXQ),  the  application  of  qualified 
number  restrictions  has  to  be  restricted  to  certain  roles, 
called  simple  roles ,  to  preserve  decidability  (Horrocks,  Sat- 
tler,  &  Tobies,  1999).  In  the  context  of  STZOXQ ,  the  defi¬ 
nition  of  simple  role  has  to  be  slightly  modified,  and  simple 
roles  figure  not  only  in  qualified  number  restrictions,  but  in 
several  other  constructs  as  well.  Intuitively,  non-simple  roles 
are  those  that  are  implied  by  the  composition  of  roles. 

Given  a  role  hierarchy  7 2/,  and  a  set  of  role  assertions  7 Za 
(without  transitivity  or  symmetry  assertions),  the  set  of  roles 
that  are  simple  in  72  =  7 2/,  U  72a  is  inductively  defined  as 
follows: 

•  a  role  name  is  simple  if  it  does  not  occur  on  the  right  hand 
side  of  a  RIA  in  7 2/,, 

•  an  inverse  role  R~  is  simple  if  R  is,  and 

•  if  7?  occurs  on  the  right  hand  side  of  a  RIA  in  72./,  ,  then  R 
is  simple  if,  for  each  w  E  R  £  72./, ,  w  =  S  for  a  simple 
role  S. 

A  set  of  role  assertions  72a  is  called  simple  if  all  roles  7?,  S 
appearing  in  role  assertions  of  the  form  lrr(7?),  Asy(7?),  or 
Dis(7?,  S),  are  simple  in  72.  If  72  is  clear  from  the  context, 
we  often  use  “simple”  instead  of  “simple  in  72”. 

Definition  4  (Role  Box)  A  STZOXQ-role  box  (Rbox  for 
short)  is  a  set  72  =  72/,  U  720,  where  72/,  is  a  regular  role 
hierarchy  and  72a  is  a  finite,  simple  set  of  role  assertions. 

An  interpretation  satisfies  a  role  box  72  (written  X  |=  72) 
ifX  |=  Rh  and  X  |=  fifor  all  role  assertions  <f>  £  7?a.  Such 
an  interpretation  is  called  a  model  ofTZ. 

Concepts  and  Inference  Problems  for  SIZOIQ 

Definition  5  (STZOXQ  Concepts,  Tboxes,  and  Aboxes) 

The  set  of  SIZOXQ-concepts  is  the  smallest  set  such  that 

•  every  concept  name  (including  nominals)  and  T are 
concepts,  and, 

•  if  C,  D  are  concepts,  R  is  a  role  (possibly  inverse),  S  is 

a  simple  role  (possibly  inverse),  and  n  is  a  non-negative 
integer,  then  C  3  79,  C  U  79,  —iC,  MR.C,  37?. C,  3«S'.Self, 
(: ’fi-nS.C ),  and  nS.C )  are  also  concepts. 

A  general  concept  inclusion  axiom  ( GCI)  is  an  expres¬ 
sion  of  the  form  C  E  79  for  two  SIZOXQ-concepts  C  and 
79.  A  Tbox  X  is  a  finite  set  ofGCIs. 

An  individual  assertion  is  of  one  of  the  following  forms: 
a  :  C,  (a,  b)  :  7?,  (a,  b)  :  -<R,  or  a  b,  for  a,  b  £  I  (the 
set  of  individual  names),  a  (possibly  inverse)  role  R,  and  a 
STZOX Q-concept  C.  A  STZOXQ-Abox  A  is  a  finite  set  of 
individual  assertions. 

It  is  part  of  future  work  to  determine  which  of  the  restric¬ 
tions  to  simple  roles  in  role  assertions  Dis(7?,  S),  Asy(7?), 
and  lrr(7?),  as  well  as  the  concept  expression  3S'.Self,  are 
strictly  necessary  in  order  to  preserve  decidability  or  practi¬ 
cability.  These  restrictions,  however,  allow  a  rather  smooth 
integration  of  the  new  constructs  into  existing  algorithms. 


Definition  6  (Semantics  and  Inference  Problems)  Given 
an  interpretation  X  =  (Ax,  -x),  concepts  C,  D,  roles  R, 
S,  and  non-negative  integers  n,  the  extension  of  complex 
concepts  is  defined  inductively  by  the  following  equa¬ 
tions,  where  jj M  denotes  the  cardinality  of  a  set  M,  and 
concept  names,  roles,  and  nominals  are  interpreted  as  in 
Definition  1: 

Tx  =  Ax,  J_x  =  0,  (^C)x  =  Ax  \  C1 

(C  n  D)x  =  cxn  D1,  (■ C  U  D)1  =  C1  U  Dx 

(3 R.C)1  =  {x  |  3 y.{x,  y)  £  R1  and  y  £  C x} 
(37?.Self)x  =  {x  j  (x,  x)  £  R1} 

(V7?.C)X  =  (x  j  Vy.(x,  y)  £  R1  implies  y  £  Cx} 
nR.C)1  =  {x  j  j) {y.(x,  y)  £  R1  and  y  £  C1}  ^  n} 
nR.C )x  =  {x  j  t){j/.(x,  j/)  £  Rx  and  y  £  Cx}  ^  n} 

X  ;,v  a  model  of  a  Tbox  T  (written  X  |=  T)  if  C1  C  Dx 
for  each  GCI C  t  D  in  X.  A  concept  C  is  called  satisfiable 
if  there  is  an  interpretation  X  with  C'x  ^  0.  A  concept  D 
subsumes  a  concept  C  (written  C  C  D)  if  C1  C  D1  holds 
for  each  interpretation.  For  an  interpretation  X,  an  element 
x  £  Ax  is  called  an  instance  of  a  concept  C  if  x  £  Cx. 

X  satisfies  (is  a  model  of)  an  Abox  A  (X  \=  A)  if  for  all 
individual  assertions  <f>  £  A  we  have  X  \=  (f>,  where 

X  \=  a:  C  if  a1  £  Cx; 

X  |=  a  b  if  ax  f  b1-, 

X  \=  ( a,b):R  if  (a1 ,  b1)  £  Rx; 

X\=  (a,b):-<R  if  (ax ,bx)  ^  Rx . 

An  Abox  A  is  consistent  with  respect  to  an  Rbox  TZ  and  a 
Tbox  X  if  there  is  a  model  X  for  IZ  and  X  such  thatX  \=  A. 

The  above  inference  problems  can  be  defined  w.r.t.  a  role 
box  IZ  and/or  a  Tbox  X  in  the  usual  way,  i.e.,  by  replacing 
interpretation  with  model  of  IZ  and/or  X . 

Reduction  of  Inference  Problems 

For  DLs  that  are  closed  under  negation,  subsumption  and 
(un)satisfiability  of  concepts  can  be  mutually  reduced:  C  C 
D  iff  C  n  —1)  is  unsatisfiable,  and  C  is  unsatisfiable  iff  C  C 
_L.  Furthermore,  a  concept  C  is  satisfiable  iff  the  Abox  {a  : 
C}  (a  a  ‘new’  individual  name)  is  consistent. 

It  is  straightforward  to  extend  these  reductions  to  Rboxes 
and  Tboxes.  In  contrast,  the  reduction  of  inference  prob¬ 
lems  w.r.t.  a  Tbox  to  pure  concept  inference  problems  (pos¬ 
sibly  w.r.t.  a  role  hierarchy),  deserves  special  care:  in  Baader 
(1991);  Schild  (1991);  Baader  et  al.  (1993),  the  internalisa¬ 
tion  of  GCIs  is  introduced,  a  technique  that  realises  exactly 
this  reduction.  For  STZOXQ,  this  technique  only  needs  to 
be  slightly  modified.  We  will  show  in  a  series  of  steps  that, 
in  STZOXQ ,  satisfiability  of  a  concept  C  with  respect  to  a 
triple  (A,TZ,X)  of,  respectively,  a  STZOXQ  Abox,  Rbox, 
and  Tbox,  can  be  reduced  to  concept  satisfiability  of  a  con¬ 
cept  C'  with  respect  to  an  Rbox  TZ! ,  where  the  Rbox  7 Z! 
only  contains  role  assertions  of  the  form  Dis(7?,  S),  Ref  (R), 
or  Asy (R),  and  the  universal  role  U  does  not  appear  in  C' . 

While  nominals  can  be  used  to  ‘internalise’  the  Abox,  in 
order  to  eliminate  the  universal  role,  we  use  a  ‘simulated’ 
universal  role  U',  i.e.,  a  reflexive,  symmetric,  and  transi¬ 
tive  super-role  of  all  roles  and  their  inverses  appearing  in 


(A,TZ,X),  and  which,  additionally,  connects  all  nominals 
appearing  in  the  input. 

Thus,  let  C  and  ( A,TZ,X )  be  a  STZOXQ  concept  and 
Abox,  Rbox,  and  Tbox,  respectively.  In  a  first  step,  we  re¬ 
place  the  Abox  A  with  an  Abox  A'  such  that  A'  only  con¬ 
tains  individual  assertions  of  the  form  a:C.  To  this  purpose, 
we  associate  with  every  individual  a  £  I  appearing  in  A  a 
new  nominal  oa  not  appearing  in  X  or  C.  Next,  A!  is  the  re¬ 
sult  of  replacing  every  individual  assertion  in  A  of  the  form 
(a,  b)  :  R  with  a  :  3 R.Ob,  every  (a,  b) :  -i R  with  a  :  \/R.~>Ob, 
and  every  a  b  with  a  :  —•ot,.  Now,  given  C  and  A! ,  define 
C  as  follows: 

C':=C  FI  I”!  3U.(oa  n  D), 

a:DGA' 

where  U  is  the  universal  role.  It  should  be  clear  that  C  is 
satisfiable  with  respect  to  (A,  TZ ,  X)  if  and  only  if  C  is  sat¬ 
isfiable  with  respect  to  (TZ,  X). 

Lemma  7  (Abox  Elimination)  STZOXQ  concept  satisfia¬ 
bility  with  respect  to  Aboxes,  Rboxes,  and  Tboxes  is  poly- 
nomially  reducible  to  STZOXQ  concept  satisfiability  with 
respect  to  Rboxes  and  Tboxes  only. 

Hence,  in  the  following  we  will  assume  that  Aboxes  have 
been  eliminated.  Next,  although  we  have  the  ‘real’  universal 
role  U  present  in  the  language,  the  following  lemma  shows 
how  general  concept  inclusion  axioms  can  be  internalised 
while  at  the  same  time  eliminating  occurrences  of  the  uni¬ 
versal  role  U,  using  a  simulated  “universal”  role  U' ,  that  is, 
a  transitive  super-role  of  all  roles  (except  U )  occurring  in  X 
or  TZ  and  their  respective  inverses.  Recall  that  the  universal 
role  U  is  not  allowed  to  appear  in  Rboxes. 


Lemma  8  (Tbox  and  Universal  Role  Elimination)  Let  C, 

D  be  concepts,  X  a  Tbox,  and  TZ  =  TZh  U  TZa  an  Rbox.  Let 
Ur  ^  U  be  a  role  that  does  not  occur  in  C,  D,  X,  or  TZ,  and, 
for  X  a  Tbox  or  a  concept,  let  X'  result  from  X  by  replacing 
every  occurrence  ofU  with  U1.  We  define 


CT'  :=  VU'.(  n  iC'uB')n(  n  3U'.o), 

C'tD'eT'  VN3oGTUCUD  ' 


TZ\l  :=  TZh  U  {R  QU'  \  R  occurs  in  C',  D' ,  X' ,  or  TZ}, 
7 Z^  :=  TZa  U  {Tra((7'),  Sym([/'),  Ref(U')},  and 
7 Zu>  '■=  7 Z\[  U  TZU,  .  Then 


•  C  is  satisfiable  w.r.t.  X  and  TZ  iff  C'  n  Gt  is  satisfiable 
w.r.t.  7 Zu>. 

•  D  subsumes  C  with  respect  to  X  and  TZ  iff  C'  rHZTnCV' 
is  unsatisfiable  w.r.t.  7 Zjj>. 


The  proof  of  Lemma  8  is  similar  to  the  ones  that  can  be 
found  in  Schild  (1991)  and  Baader  (1991).  Most  impor¬ 
tantly,  it  must  be  shown  that  (a):  if  a  STZOXQ-concept  C 
is  satisfiable  with  respect  to  a  Tbox  X  and  an  Rbox  TZ,  then 
C,  X ,  TZ  have  a  nominal  connected  model,  i.e.,  a  model 
which  is  a  union  of  connected  components,  where  each  such 
component  contains  a  nominal,  and  where  any  two  elements 


of  a  connected  component  are  connected  by  a  role  path  over 
those  roles  occurring  in  C,  T  or  72,  and  (b):  if  y  is  reachable 
from  x  via  a  role  path  (possibly  involving  inverse  roles),  then 
( x ,  y)  £  U'X .  These  are  easy  consequences  of  the  semantics 
and  the  definition  of  U'  and  Ct>,  which  guarantees  that  all 
nominals  are  connected  by  U’  links. 

Now,  note  also  that,  instead  of  having  a  role  assertion 
lrr(i?)  £  lZa,  we  can  add,  equivalently,  the  GCI  T  C 
—i3.f2.Self  to  T,  which  can  in  turn  be  internalised.  Likewise, 
instead  of  asserting  Ref(  R),  we  can,  equivalently,  add  the 
GCI  T  t  3f?.Self  to  T.  However,  in  the  case  of  Ref(f?) 
this  replacement  is  only  admissible  for  simple  roles  R  and 
thus  not  possible  (syntactically)  in  general. 

Thus,  using  these  equivalences  (including  the  replace¬ 
ment  of  Rbox  assertions  of  the  form  Symf/7)  and  Tra(  R)) 
and  Lemmas  7  and  8,  we  arrive  at  the  following  theorem: 

Theorem  9  (Reduction) 

1.  Satisfiability  and  subsumption  of  SlZOl  Q-concepts  w.r.t. 
Tboxes,  Aboxes,  and  Rboxes,  are  polynomially  reducible 
to  (un)satisfiability  of  SlZOl Q-concepts  w.r.t.  Rboxes. 

2.  W.l.o.g.,  we  can  assume  that  Rboxes  do  not  contain  role 
assertions  of  the  form  I rr(f2),  Tra(72),  or  Sym (R),  and 
that  the  universal  role  is  not  used. 

With  Theorem  9,  all  standard  inference  problems  for 
SlZOl Q-concepts  and  Aboxes  can  be  reduced  to  the  prob¬ 
lem  of  determining  the  consistency  of  a  SlZOl Q-concept 
w.r.t.  to  an  Rbox  (both  not  containing  the  universal  role), 
where  we  can  assume  w.l.o.g.  that  all  role  assertions  in  the 
Rbox  are  of  the  form  Ref(72),  Asy(72),  or  Dis(72,  S) — we 
call  such  an  Rbox  reduced. 

SKOIQ  is  Decidable 

In  this  section,  we  show  that  SlZOl Q  is  decidable.  We 
present  a  tableau-based  algorithm  that  decides  the  consis¬ 
tency  of  a  SlZOl Q  concept  w.r.t.  a  reduced  Rbox,  and 
therefore  also  all  standard  inference  problems  as  discussed 
above,  see  Theorem  9.  Therefore,  in  the  following,  by  Rbox 
we  always  mean  reduced  Rbox. 

The  algorithm  tries  to  construct,  for  a  SlZOl Q-concept 
C  and  an  Rbox  72,  a  tableau  for  C  and  72,  that  is,  an  abstrac¬ 
tion  of  a  model  of  C  and  1Z. 

For  a  regular  role  hierarchy  72/,  and  a  (possibly  inverse) 
role  S  occurring  in  72/,  ,  a  non-deterministic  finite  automaton 
(NFA)  Bs  is  defined.  The  construction  of  these  automata  is 
identical  to  the  one  presented  in  Horrocks  &  Sattler  (2004), 
and  we  therefore  refer  to  this  paper  for  detailed  definitions 
and  proofs  of  the  automata  related  results  below. 

The  following  proposition  states  that  Bs  indeed  captures 
all  implications  between  (paths  of)  roles  and  S  that  are  con¬ 
sequences  of  the  role  hierarchy  72/,,  where  L(Bs)  denotes 
the  language  (a  set  of  strings  of  roles)  accepted  by  By. 

Proposition  10  1  is  a  model  oflZh  if  and  only  if,  for  each 
(possibly  inverse )  role  S  occurring  in  IZh,  each  word  w  £ 
L(Bs),  and  each  ( x ,  y)  £  w1,  we  have  ( x ,  y)  £  S1. 


Unfortunately,  as  shown  in  Horrocks  &  Sattler  (2004),  the 
size  of  Bn can  be  exponential  in  the  size  of  72.  Horrocks  & 
Sattler  (2004)  consider  certain  further  syntactic  restrictions 
of  role  hierarchies  (there  called  simple  role  hierarchies)  that 
avoid  this  exponential  blow-up.  We  conjecture  that,  without 
some  such  further  restriction,  this  blow-up  is  unavoidable. 
The  following  technical  Lemma  from  Horrocks  &  Sattler 
(2004)  will  be  needed  later  on. 

Lemma  11 

1.  S  £  L(Bs )  and,  if  w  C  S  £  1Z,  then  w  £  L(Bs). 

2.  If  S  is  a  simple  role,  then  L(Bs)  =  {R  \  R  jj|5}. 

3-  L(B\m(S))  =  {lnv(w)  |  w  £  L(BS)}. 

A  Tableau  for  SlZOl  Q 

In  the  following,  if  not  stated  otherwise,  C,  D  (possibly 
with  subscripts)  denote  <S72CZ<2-concepts  (not  using  the 
universal  role),  R,  S  (possibly  with  subscripts)  roles,  72  = 
TZh  U  lZa  an  Rbox,  and  Rr  the  set  of  roles  occurring  in  C 
and  72  together  with  their  inverses.  Furthermore,  as  noted  in 
Theorem  9,  we  can  (and  will  from  now  on)  assume  w.l.o.g. 
that  all  role  assertions  appearing  in  lZa  are  of  the  form 
Dis(72,  S),  Asy(72),  or  Ref(72). 

We  start  by  defining  fclos(Co,  72),  the  closure  of  a  concept 
Co  w.r.t.  a  regular  role  hierarchy  72.  Intuitively,  this  contains 
all  relevant  sub-concepts  of  Co  together  with  universal  value 
restrictions  over  sets  of  role  paths  described  by  an  NFA.  We 
use  NFAs  in  universal  value  restrictions  to  memorise  the 
path  between  an  object  that  has  to  satisfy  a  value  restriction 
and  other  objects.  To  do  this,  we  “push”  this  NFA-value  re¬ 
striction  along  all  paths  while  the  NFA  gets  “updated"  with 
the  path  taken  so  far.  For  this  “update”,  we  use  the  following 
definition. 

Definition  12  For  B  an  NFA  and  q  a  state  of  B,  B(q)  de¬ 
notes  the  NFA  obtained  from  B  by  making  q  the  (only)  initial 

C, i 

state  of  B,  and  we  use  q  — >  q'  £  B  to  denote  that  B  has  a 
transition  q  —>  q'. 

Without  loss  of  generality,  we  assume  all  concepts  to  be 
in  negation  normal  form  (NNF),  that  is,  negation  occurs 
only  in  front  of  concept  names  or  in  front  of  372. Self.  Any 
<S72CXQ-conccpt  can  easily  be  transformed  into  an  equiv¬ 
alent  one  in  NNF  by  pushing  negations  inwards  using  a 
combination  of  De  Morgan’s  laws  and  equivalences  such  as 
-.(3ii.cj  =  (VR.-.C),  nR.C )  =  (>(n+  1)R.C),  etc. 

We  use  AC  for  the  NNF  of  ->C.  Obviously,  the  length  of 
AC  is  linear  in  the  length  of  C. 

For  a  concept  Co,  clos(Co)  is  the  smallest  set  that  contains 
Co  and  that  is  closed  under  sub-concepts  and  A.  The  set 
fclos(Co,72)  is  then  defined  as  follows: 

fclos(C0,72)  :=  clos(C0)  U  {fBs(q)-D  \ 

\/S.D  £  clos(Co)  and  Bs  has  a  state  q}. 

It  is  not  hard  to  show  and  well-known  that  the  size  of 
clos(Co)  is  linear  in  the  size  of  Co-  For  the  size  of 
fclos(Co,  72),  we  have  mentioned  above  that,  for  a  role  S, 


the  size  of  Bs  can  be  exponential  in  the  depth  of  TZ.  Since 
there  are  at  most  linearly  many  concepts  \/S.D,  this  yields  a 
bound  for  the  cardinality  of  fclos(C'o,  TZ)  that  is  exponential 
in  the  depth  of  TZ  and  linear  in  the  size  of  Co- 

Definition  13  (Tableau)  T  =  (S,  L,  £)  is  a  tableau  for  Co 
w.r.t.  7 Z  if 

•  S  is  a  non-empty  set; 

•  £  :  S  ->  2*dos(Co'n)  maps  each  element  in  S  to  a  set  of 
concepts; 

•  E  :  Rc0  — +  2s  xS  maps  each  role  to  a  set  of  pairs  of 
elements  in  S; 

•  Co  G  L(s)  for  some  s£  S. 

Furthermore,  for  all  s,t  G  S,  C,  C\ ,  C2  €  fclos(Co,  71), 
o  €  N  fl  fclos(Co,7£),  R,Sg  RCo,  and 

ST(s,  C)  :={r  £  S  |  ( s,r )  £  £{S)  and  C  £  <C(r)}, 

the  tableau  T  satisfies: 

(PI)  C£L(s)  =>  ~^C  fE(s), 

(C  atomic  or  37?. Self); 

(P2)  T  G  L(s),  and  ±  f  L{s); 

(P3)  37?. Self  £  L(s)  =►  ( s,s )  £  £(R); 

(P4)  — <3i?.Self  £  L(s)  =>  (s,s)  f  £(7?); 

(P5)  Ci  n  C2  £  £(s)  = =►  Ci,C2  G  E(s); 

(P6)  CiUC2GC(s)  =>  Ci  G  L(s)orC2  £  £,(«); 

(P7)  VB(p).C  £  E(s),  ( s,t )  £  £(S), 

and p  —>  q  £  B(p)  =4-  VB(q).C  G  L(i); 

(P8)  WB.C  £  L(s)  and  e  £  L(B)  =>  CgC(s); 

(P9)  V5.C  G  £(s)  =►  WBS.C  £  E(s); 

(P10)  SSIC  G  L(s)  =4-  there  is  some  r  £  S  with 
( s,r )  £  E(S)  and  C  £  L(r); 

(Pll)  (s,t)  £  E(R)  4=4-  (t,  s)  £  £(lnv(7?)); 

(P12)  (s,t)  £  £(R)  andRSS  =4-  (s,t)  £  £(£>); 

(P13)  {^nS.C)  £  C(s)  =►  #ST(s,C)  ^  n; 

(P14)  ffinS.C)  £  H{s)  = =►  #5T(s,C)  ^  n; 

(P15)  nS.C )  G  £(s)  and  ( s,t )  £  £(S)  =4> 

CGC(f)  or  GCGC(f); 

(P16)  Dis(i?,  S)  £  TZa  =4-  £(i?)  n  £(S)  =  0; 

(P17)  Ref(i?)G-Ka  =►  (s,  s)  £  £(R); 

(P18)  Asy (i?)  £TZa  =4>  \s,t)  £  E(R) 
implies  ( t,s )  E(R); 

(P19)  o  £  L(r)  for  some  r  £  S; 

(P20)  o  £  C(s)  n  H(t)  =4>  s  =  t. 

Theorem  14  (Tableau)  A  SIZOlQ-concept  Co  is  satisfi- 
able  w.r.t.  a  reduced  Rbox  7 Z  iff  there  exists  a  tableau  for 
Co  w.r.t.  1Z. 

Proof:  For  the  if  direction,  let  T  =  (S.  £  ,  £)  he  a  tableau 
for  Co  w.r.t.  1Z.  We  extend  the  relational  structure  of  T  and 
then  prove  that  this  indeed  gives  a  model. 

More  precisely,  a  model  X  =  (A1,  -x)  of  Co  and  1Z  can 
be  defined  as  follows:  we  set  A1  :=  S,  Cx  :=  {s  |  C  G 
£(s)}  for  concept  names  C  in  fclos(Co, 7Z),  where  (P19) 


and  ( P20)  guarantee  that  nominals  are  indeed  interpreted  as 
singleton  sets,  and,  for  roles  names  R  £  Rev  we  set 

Rx  :=  {(so>  sn)  G  (A1)2  |  exists  si, . . . ,  sn_  1  with 
{si,  Si+i)  G  £(S'i+i)  and  S\  ■  ■  ■  Sn  £  L(Br)} 

The  semantics  of  complex  concepts  is  given  through  the  def¬ 
inition  of  the  S'fZOX Q-semantics.  Due  to  Lemma  11.3  and 
(Pll),  the  semantics  of  inverse  roles  can  either  be  given  di¬ 
rectly  as  for  role  names,  or  by  setting  ( R~)x  :=  {(y,x)  \ 
(x,y)  £  Rxj.  Moreover,  we  have,  by  definition  of  X, 
Lemma  11.2,  (Pll),  and  (P12)  that,  for  T  a  simple  role, 
Tx  =  £(T). 

We  have  to  show  that  X  is  a  model  of  1Z  and  Co-  We  begin 
by  showing  that  X  \=  1Z.  Since  7 Z  is  reduced,  we  only  have 
to  deal  with  role  assertions  of  the  form  Dis(i?,  S),  Ref(7?), 
and  Asy  (R). 

Consider  an  assertion  Dis (R,  S )  G  7 Z.  By  definition  of 
S'R.OX Q-Rboxes,  both  R  and  S  are  simple  roles,  and  thus 
Rx  =  E(R)  and  Sx  =  E(S).  Moreover,  (P16)  implies 
E(R)  n£(£>)  =  0,  and  thus  RxnSx  =  0.  Next,  if  Ref  (R)  G 
TZa,  (P17)  and  R  £  L(Bb)  (Lemma  11.1)  imply  Diag1  C 
Rx.  Finally,  if  Asy (R)  G  TZa  then  Rx  =  E(R)  since  R 
is  simple,  and  so  (s,f)  G  Rx  implies  ( s,t )  G  E(R)  and 
so  (t,s)  fi  E(R)  by  (P18),  whence  (t,  s)  f  Rx .  Thus  X 
satisfies  each  role  assertion  in  7Za. 

Next,  we  have  to  show  that  X  |=  TZh-  Due  to  Proposi¬ 
tion  10,  it  suffices  to  prove  that,  for  each  (possibly  inverse) 
role  S,  each  word  w  £  L(Bs),  and  each  ( x,y )  £  wx,  we 
have  (x,  y)  £  Sx .  The  proof  of  this  is  identical  to  the  case 
of  7 ZXQ  and  can  be  found  in  Horrocks  &  Sattler  (2004). 

Secondly,  we  prove  that  X  is  a  model  of  Co-  We  show 
that  C  G  L(s)  implies  s  £  Cx  for  each  s  £  S  and  each 
C  £  fclos(34. 7 Z).  This  proof  can  be  given  by  induction  on 
the  length  of  concepts,  where  we  count  neither  negation  nor 
integers  in  number  restrictions.  The  only  interesting  cases 
are  C  =  (^ nS.E ),  C  =  \/S.E,  and  C  =  (— i)3i?.Self  (for 
the  other  cases,  see  Horrocks,  Sattler,  &  Tobies  (2000);  Hor¬ 
rocks  &  Sattler  (2002)): 

•  If  (?fnS.E)  £  L(s),  then  (P13)  implies  that 

fiST(s,E)  <  n.  Moreover,  since  S  is  simple, 
Lemma  11.2  implies  that  L(Bs)  =  {*S"  |  S'  US'}, 
and  (P12)  implies  that  Sx  =  E(S).  Hence  (P15)  im¬ 
plies  that,  for  all  t,  if  ( s,t )  £  Sx,  then  E  £  L(f)  or 
AE  G  L(f).  By  induction  Ex  =  {t  \  E  £  L(f)},  and 
thus  s  £  (^ nS.E)1 . 

•  Let  \/S.E  £  L(s)  and  (s,t)  £  Sx .  From  (P9)  we 
have  that  VBs-E  £  L(s).  By  definition  of  Sx,  there  are 
S\ . . .  Sn  G  L(Bs)  and  s,;  with  s  =  so,  t  =  sn,  and 
(si-i,Si)  £  £(Si).  Applying  (P7)  n  times,  this  yields 
\/Bs(q)-E  £  H(t)  for  q  a  final  state  of  Bs-  Thus  (P8) 
implies  that  E  £  L(f).  By  induction,  t  £  Ex,  and  thus 
s  £  ( \/S.E)x . 

•  Let  37?. Self  G  L(s).  Then,  by  (P3),  (s,s)  £  £(7?)  and, 
since  R  £  L(Bjf)  and  by  definition  of  X,  we  have  ( s ,  s )  G 
Rx.  It  follows  that  s  £  (37?.Self)x. 


•  Let  — i37?.Self  G  L(s).  Then,  by  (P4),  ( s,s )  (f  £(£?). 
Since  R  is  a  simple  role,  R1  =  E(R).  Hence  ( s ,  s)  (£  Rx, 
and  sos  £  (->3i?.Self)x. 

For  the  converse,  suppose  X  —  (A1,  -x)  is  a  model  of  Co 
w.r.t.  TZ.  We  define  a  tableau  T  =  (S,  L,  £)  for  Co  and  TZ  as 
follows: 

S  :=  Ax; 

E(R)  :=  R1-  and 

£(s)  :=  {C  G  fclos(C0,ft)  |  s  G  Cx} 

U  {VBS.C  |  VS'.C  G  fclos(C0,ft)  and  s  G  (VS.C)X| 

U  {\ iBR{q).C  G  fclos(C0,  ft)  |  &  •  •  •  S„  G  L(B*(g))  =► 
s  G  (V5i.V52.  •  •  -VS„.C)X,£  G  L(BR(q))  =>  a  G  C1} 

We  have  to  show  that  T  satisfies  (P1)-(P20),  and  restrict 
our  attention  to  the  only  new  cases. 

For  (P9),  if  VS.C  G  L(s),  then  s  G  (VS.C)X  and  thus 
WBs-C  G  L(s)  by  definition  of  T. 

For  (P7),  let  VB(p).C  G  L(s)  and  (s,  £)  G  £  (S').  Assume 

that  there  is  a  transition  p  — >  gin  B(p)  and  VS(g).C  ^  £(£). 
By  definition  of  T,  this  can  have  two  reasons: 

1.  there  is  a  word  S2  ■  ■  ■  Sn  G  L(B(q))  and 

£  ^  (VS2.  •  ■  •  VSn.C)x.  However,  this  implies 

that  SS2  . . .  S„  G  L(B(p))  and  thus  that  we  have 
s  G  (VS.VS2. . . .  VSn.C)x,  which  contradicts,  together 
with  ( s ,  £)  G  Sx,  the  definition  of  the  semantics  of 
STZOXQ  concepts. 

2.  e  G  L(B(q ))  and  £  ^  Cx.  This  implies  that  S  G  L(B(p)) 
and  thus  contradicts  s  G  (VS.C)X. 

For  (P8),  £  G  L{B(jp))  implies  s  G  Cx  by  definition  of 
T,  and  thus  C  G  L(s). 

Finally,  (P16)-(P20)  follow  immediately  from  the  defi¬ 
nition  of  the  semantics. 

□ 

The  Tableau  Algorithm 

In  this  section,  we  present  a  terminating,  sound,  and 
complete  tableau  algorithm  that  decides  consistency  of 
SftOXQ-concepts  not  using  the  universal  role  w.r.t.  reduced 
Rboxes,  and  thus,  using  Theorem  9,  also  concept  satisfiabil¬ 
ity  w.r.t.  Rboxes,  Tboxes  and  Aboxes. 

We  first  define  the  underlying  data  structures  and  corre¬ 
sponding  operations.  For  more  detailed  explanations  con¬ 
cerning  the  intuitions  underlying  these  definitions,  consult 
Horrocks  &  Sattler  (2005). 

The  algorithm  generates  a  completion  graph ,  a  structure 
that,  if  complete  and  clash-free,  can  be  unravelled  to  a  (pos¬ 
sibly  infinite)  tableau  for  the  input  concept  and  Rbox.  More¬ 
over,  it  is  shown  that  the  algorithm  returns  a  complete  and 
clash-free  completion  graph  for  Co  and  TZ  if  and  only  if  there 
exists  a  tableau  for  Co  and  TZ ,  and  thus  with  Lemma  14,  if 
and  only  if  the  concept  Co  is  satisfiable  w.r.t.  TZ. 

As  usual,  in  the  presence  of  transitive  roles,  blocking  is 
employed  to  ensure  termination  of  the  algorithm  (Horrocks, 
Sattler,  &  Tobies,  2000). 


Definition  15  (Completion  Graph)  Let  TZ  be  a  reduced 
Rbox,  let  Co  be  a  STZOX Q-concept  in  NNF  not  using  the 
universal  role,  and  let  N  be  the  set  of  nominals.  A  com¬ 
pletion  graph  for  Co  with  respect  to  TZ  is  a  directed  graph 
G  =  (H,  E,SL,fa)  where  each  node  x  £  V  is  labelled  with 
a  set 

lZ(x)  C  fclos(Co,ft)  U  N  U  {(^mft.C)  | 

(^nft.C)  G  fclos(Co,  TZ)  and  m  <  n} 

and  each  edge  (x,  y)  G  E  is  labelled  with  a  set  of  role 
names  E((x,y))  containing  (possibly  inverse )  roles  occur¬ 
ring  in  Co  or  TZ.  Additionally,  we  keep  track  of  inequalities 
between  nodes  of  the  graph  with  a  symmetric  binary  relation 
between  the  nodes  ofG. 

In  the  following,  we  often  use  R  G  £((&,  y))  as  an  abbre¬ 
viation  for  (x,  y)  G  E  and  R  £  £>((&,  y)). 

If  (x,  y)  G  E,  then  y  is  called  a  successor  of  x  and  x  is 
called  a  predecessor  of  y.  Ancestor  is  the  transitive  closure 
of  predecessor,  and  descendant  is  the  transitive  closure  of 
successor.  A  node  y  is  called  an  R-successor  of  a  node  x  if, 
for  some  R'  with  R'  E  R,  R'  G  £((&,  y)).  A  node  y  is  called 
a  neighbour  ( R-neighbour)  of  a  node  x  if  y  is  a  successor 
(R-successor)  of  x  or  if  x  is  a  successor  (\nv(R)-successor) 

ofy. 

For  a  role  S  and  a  node  x  in  G,  we  define  the  set  of  x’s 
S-neighbours  with  C  in  their  label,  SG(x,  C),  as  follows: 

SG(x,  C)  :=  {y  |  y  is  an  S -neighbour  of  x  and  C  £  £>(y)}. 

G  is  said  to  contain  a  clash  if  there  are  nodes  x  and  y 
such  that 

1.  1  G  L(x),  or 

2.  for  some  concept  name  A,  {A,  —1 A }  C  E(x),  or 

3.  x  is  an  S-neighbour  ofx  and  ^BS'.Self  G  £(x),  or 

4.  there  is  some  Dis(i?,  S)  G  TZa  and  y  is  an  R-  and  an 
S-neighbour  ofx,  or 

5.  there  is  some  Asy  (R)  £  TZa  and  y  is  an  R-neighbour  ofx 
and  x  is  an  R-neighbour  ofy,  or 

6.  there  is  some  concept  nS.C )  G  £(x)  and 
{yo,  ■  ■  ■ ,  Vn}  C  SG(x,  C)  with  yt  Zfi  yj  far  all  0  <  i  < 
j  <  n,  or 

7.  for  some  o  G  N,  1  /  1/  and  o  G  £(x)  D  £>(y). 

If  01 , . . . ,  Of  are  all  the  nominals  occurring  in  Co  then  the 
tableau  algorithm  is  initialised  with  the  completion  graph 
G  =  ({ro,TT...,r*},0, £,0)  with  £(r0)  =  {C0}  and 
^(ri)  =  {°i}  f°r  1  <  *  <  L  G  is  then  expanded  by 
repeatedly  applying  the  expansion  rules  given  in  Figure  1, 
stopping  if  a  clash  occurs. 

Before  describing  the  tableau  algorithm  in  more  detail,  we 
define  some  terms  and  operations  used  in  the  (application  of 
the)  expansion  rules: 

Nominal  Nodes  and  Blockable  Nodes  A  node  a;  is  a 
nominal  node  if  L(x)  contains  a  nominal.  A  node  that  is 
not  a  nominal  node  is  a  blockable  node.  A  nominal  oGN 
is  said  to  be  new  in  G  if  no  node  in  G  has  o  in  its  label. 


Blocking  A  node  x  is  label  blocked  if  it  has  ancestors  x' , 
y  and  y'  such  that 

1 .  a;  is  a  successor  of  x'  and  y  is  a  successor  of  y', 

2.  y,  x  and  all  nodes  on  the  path  from  y  to  x  are  blockable, 

3.  £(x)  =  &(y)  and  L(x')  =  £(t/),  and 

4.  £((x',x))  =  L((y',y)). 

In  this  case,  we  say  that  y  blocks  x.  A  node  is  blocked  if 
either  it  is  label  blocked  or  it  is  blockable  and  its  predeces¬ 
sor  is  blocked;  if  the  predecessor  of  a  blockable  node  x  is 
blocked,  then  we  say  that  x  is  indirectly  blocked. 
Generating  and  Shrinking  Rules  and  Safe  Neighbours 
The  3-  and  AW -rules  are  called  generating  rules,  and 
the  and  the  o-rule  are  called  shrinking  rules.  An  R- 
neighbour  y  of  a  node  x  is  safe  if  (i)  x  is  blockable  or  if  (ii) 
a;  is  a  nominal  node  and  y  is  not  blocked. 

Pruning  When  a  node  y  is  merged  into  a  node  x,  we 
“prune”  the  completion  graph  by  removing  y  and,  recur¬ 
sively,  all  blockable  successors  of  y.  More  precisely,  prun¬ 
ing  a  node  y  (written  Prune(j/))  in  G  =  (V.  E,  L,  yields 
a  graph  that  is  obtained  from  G  as  follows; 

1.  for  all  successors  2  of  y,  remove  (y,  z)  from  E  and,  if  2  is 
blockable.  Pruned); 

2.  remove  y  from  V. 

Merging  In  some  rules,  we  “merge”  one  node  into  another 
node.  Intuitively,  when  we  merge  a  node  y  into  a  node  x,  we 
add  £>(y)  to  £(x),  “move”  all  the  edges  leading  to  y  so  that 
they  lead  to  x  and  “move”  all  the  edges  leading  from  y  to 
nominal  nodes  so  that  they  lead  from  x  to  the  same  nominal 
nodes;  we  then  remove  y  (and  blockable  sub-trees  below  y) 
from  the  completion  graph.  More  precisely,  merging  a  node 
y  into  a  node  x  (written  Merge(y,  x))  in  G  =  (V,  E,L,^) 
yields  a  graph  that  is  obtained  from  G  as  follows; 

1.  for  all  nodes  2  such  that  (2,  y)  £  E 

(a)  if  {(x,  2),  (2,  x)}  HE  =  0,  then  add  (2,  x)  to  E  and  set 
£«2,x))  =  L((z,y)), 

(b)  if  (z,x)  £  E,  then  set  E((z,x))  =  fZ((z,x))  U 

£((z.y))> 

(c)  if  (x,z)  €  E,  then  set  £((x,  2))  =  L((x,z))  U 
{Inv(S)  |  S  £  £((2,2/))},  and 

(d)  remove  (2,  y)  from  E\ 

2.  for  all  nominal  nodes  2  such  that  {y,  z)  £  E 

(a)  if  {(x,  2),  (2,  x)}  HE  =  0,  then  add  (x,  2)  to  E  and  set 
£((x,2))  =  fl({y,z)), 

(b)  if  (x,z)  £  E,  then  set  £((x,  2))  =  fi((x,z))  U 

£((j/.4)> 

(c)  if  (z,x)  £  E,  then  set  Ju({z,x))  =  L({z,x))  U 
{Inv(S')  |  S  £  Z((y,z))},  and 

(d)  remove  (y,  z)  from  E\ 

3.  set  £(x)  =  £(x)  U  E(y); 

4.  add  1/2  for  all  2  such  that  y  -=f  2;  and 

5.  Prune(y). 

If  y  was  merged  into  x,  we  call  x  a  direct  heir  of  y,  and  we 
use  being  an  heir  of  another  node  for  the  transitive  closure 
of  being  a  “direct  heir”. 


Level  (of  Nominal  Nodes)  Let  01 , . . . ,  on  be  all  the  nom- 
inals  occurring  in  the  input  concept  1).  We  define  the  level 
of  a  node  inductively  as  follows: 

•  each  (nominal)  node  x  with  an  Oi  £  £(x),  1  <  i  <  £,  is 
of  level  0,  and 

•  a  nominal  node  x  is  of  level  i  if  x  is  not  of  some  level 
j  <  i  and  x  has  a  neighbour  that  is  of  level  i  —  1. 

Strategy  (of  Rule  Application)  The  expansion  rules  in 
Figure  1  are  applied  according  to  the  following  strategy: 

1 .  the  o-rule  is  applied  with  highest  priority, 

2.  next,  the  and  the  AW -rule  are  applied,  and  they  are  ap¬ 
plied  first  to  nominal  nodes  with  lower  levels  (before  they 
are  applied  to  nodes  with  higher  levels).  In  case  they  are 
both  applicable  to  the  same  node,  the  AW -rule  is  applied 
first. 

3.  all  other  rules  are  applied  with  a  lower  priority. 

We  are  now  ready  to  finish  the  description  of  the  tableau 
algorithm.  A  completion  graph  is  complete  if  it  contains  a 
clash,  or  when  none  of  the  rules  is  applicable.  If  the  expan¬ 
sion  rules  can  be  applied  to  Go  and  TZ  in  such  a  way  that 
they  yield  a  complete,  clash-free  completion  graph,  then  the 
algorithm  returns  “Go  is  satisfiable  w.r.t.  1Z”,  and  “Go  is  un- 
satisfiable  w.r.t.  1Z”  otherwise. 

Termination,  Soundness,  and  Completeness 

All  but  the  Self-Ref-rule  have  been  used  before  for  frag¬ 
ments  of  S1ZOTQ ,  see  Horrocks,  Sattler,  &  Tobies  (1999); 
Horrocks  &  Sattler  (2002,  2004),  and  the  three  Vj-rules  are 
the  obvious  counterparts  to  the  tableau  conditions  (P7)- 
(P9). 

As  usual,  we  prove  termination,  soundness,  and  complete¬ 
ness  of  the  tableau  algorithm  to  show  that  it  indeed  decides 
satisfiability  of  SIZOIQ- concepts  w.r.t.  Rboxes. 

Theorem  16  (Termination,  Soundness,  and  Completeness) 

Let  Go  be  a  STZOXQ-concept  in  NNF  and  TZ  a  reduced 
Rbox. 

1.  The  tableau  algorithm  terminates  when  started  with  Go 
and  TZ. 

2.  The  expansion  rules  can  be  applied  to  Go  and  JZ  such  that 
they  yield  a  complete  and  clash-free  completion  graph  if 
and  only  if  there  is  a  tableau  for  Go  w.r.t.  TZ. 

Proof:  (1):  The  algorithm  constructs  a  graph  that  consists  of 
a  set  of  arbitrarily  interconnected  nominal  nodes,  and  “trees” 
of  blockable  nodes  with  each  tree  rooted  in  ro  or  in  a  nomi¬ 
nal  node,  and  where  branches  of  these  trees  might  end  in  an 
edge  leading  to  a  nominal  node. 

Termination  is  a  consequence  of  the  usual  SH1Q  condi¬ 
tions  with  respect  to  the  blockable  tree  parts  of  the  graph, 
plus  the  fact  that  there  is  a  bound  on  the  number  of  new 
nominal  nodes  that  can  be  added  to  G  by  the  AW -rule. 

The  termination  proof  for  the  STZOXQ  tableaux  is  virtu¬ 
ally  identical  to  the  one  for  STIOIQ,  whence  we  omit  the 


n-rule: 

if  Gi  n  C2  £  T(x),  x  is  not  indirectly  blocked, 
and  {Ci,  £2}  ££(*), 
then  L(x)  — >  £(x)  U  {CT,  C2} 

U-mle: 

if  Ci  13  C2  £  T(x),  x  is  not  indirectly  blocked, 
and{Ci,C2}  fi£(x)  =  0 

then  L(x)  — >  E(x)  U  { E }  for  some  E  £  {Ci,  C2} 

3 -rule: 

if  3S.C  £  £(x),  x  is  not  blocked,  and 
x  has  no  /(-neighbour  y  with  C  £  L  ( y ) 
then  create  a  new  node  y  with 

£«x,2/»  :=  {S'}  and  £ (y)  :=  {C} 

Self-Ref -rule:  if  BS.Self  £  L{x)  or  Ref  (5)  £  TZa, 
x  is  not  blocked,  and  S  (f:  £({x,x)) 
then  add  an  edge  (x,  x)  if  it  does  not  yet  exist,  and 
set  L((x,  x))  — >  &((x,  x))  U  {/(} 

Vi-rule: 

if  VS.C  £  £(x),  x  is  not  indirectly  blocked, 
and  VHS.C  ^  £(x) 
then  £ (x)  — »  L(x)  U  {VZ3S.C} 

V2-rule: 

if  \/B(p).C  £  £j(x),  x  is  not  indirectly  blocked, 

s 

p  —>  q  in  B(p),  and  there  is  an  /(-neighbour 
y  of  x  with  \/B(q).C  £  &(y), 
then£(y)  ^tL(y)U{VH(q).C} 

V3-rule: 

if  \/B.C  £  £(x),  x  is  not  indirectly  blocked, 
e  £  L{B)  and  C  £L  &(x) 
then  fi(x)  — >  £(x)  U  {C} 

choose-rule: 

if  (^ nS.C )  £  E(x),  x  is  not  indirectly  blocked, 
and  there  is  an  /(-neighbour  y  of  x 
with  {C,£C}  D£ (y)  =  0 
then  £(y)  — ♦  &(y)  U  {/(}  for  some  E  £  {C,  £C} 

T-rule: 

if  1.  (^ nS.C )  £  E(x),  x  is  not  blocked 

2.  there  are  not  n  safe  /(-neighbours 
yi , . . . ,  yn  of  x  with  C  £  L(yi) 
and  yi  ^  j/j  for  1  <  i  <  j  <  n 
then  create  n  new  nodes  yi, ...  ,yn  with 
£((x,yi))  =  {S},L(yt)  =  {C}, 
and  yi  yj  for  1  <  i  <  j  <  n. 

^-rule: 

if  1.  (^ nS.C )  £  E(z),  z  is  not  indirectly  blocked 

2.  |J SG(z,  C)  >  n  and  there  are  two  S'-neighbours 
x,  y  of  a  with  C  £  H(x)  fl  £(y),  and  not  x  =/=■  y 
then  1.  if  x  is  a  nominal  node  then  Merge(y,  x) 

2.  else,  if  y  is  a  nominal  node  or  an 
ancestor  of  x  then  Merge(x,  y) 

3.  else  Merge(j/,  x) 

o-rule: 

if  for  some  0  £  AT  there  are  2  nodes  x,  y 
with  0  £  L  (x)  fl  £  (y)  and  not  x  ^  y 
then  Merge(x,  y) 

AW-rule: 

if  1.  nS.C )  £  -C(x),  *  is  a  nominal  node,  and 

there  is  a  blockable  /(-neighbour  y  of  x  such 
that  C  £  L  (y)  and  x  is  a  successor  of  y, 

2.  there  is  no  m  such  that  1  ^  m  ^  n, 

mS.C )  £  Cj(x),  and  there  exist  m  nominal 
/(-neighbours  zi, ...  ,zm  of  x  with  C  £  L(zi) 
and  Zi  Zj  for  all  1  <  i  <  j  <  m. 
then  1.  guess  m  with  1  ^  m  ^  n, 

and  set  £(x)  —  £(x)  U  {(^mS'.C)} 

2.  create  m  new  nodes  y  1, ... ,  ym  with 
L((x,yi))  =  {S},L(yi)  =  {C,Oi}, 
for  each  Oi  £  Ni  new  in  G, 
and  yi  ^  yj  for  1  <  i  <  j  <  m. 

Figure  1:  The  Expansion  Rules  for  the  S1ZOIQ  Tableau 
Algorithm. 


details  and  refer  the  reader  to  Horrocks  &  Sattler  (2005). 
To  see  this,  note  first  that  the  blocking  technique  employed 
for  S1ZOTQ  is  identical  to  the  one  for  SHOIQ.  Next, 
the  closure  fclos(Co, TV)  is  defined  differently,  comprising 
concepts  of  the  form  VBs{q)-C,  generally  yielding  a  size  of 
fclos(Co,  TZ)  that  can  be  exponential  in  the  depth  of  the  role 
hierarchy.  However,  the  construction  of  the  automata  can 
also  be  considered  a  pre-processing  step  and  part  of  the  in¬ 
put,  in  that  case  keeping  the  polynomial  bound  on  the  size 
of  the  closure  relative  to  the  input.  Furthermore,  it  should  be 
clear  that  the  new  Self-  Ref-rule  (only  adding  new  reflexive 
edges)  as  well  as  the  new  clash  conditions  do  not  affect  the 
termination  of  the  algorithm. 

(2):  For  the  “if”  direction,  we  can  obtain  a  tableau  T  = 
(S,  £/.  £)  from  a  complete  and  clash-free  completion  graph 
G  by  unravelling  blockable  “tree”  parts  of  the  graph  as  usual 
(these  are  the  only  parts  where  blocking  can  apply). 

More  precisely,  paths  are  defined  as  follows.  For  a  label 
blocked  node  x,  let  b(x)  denote  a  node  that  blocks  x. 

A  path  is  a  sequence  of  pairs  of  blockable  nodes  of 
G  of  the  form  p  =  (( xo ,  x'0 ), . . . ,  ( xn ,  x'n)).  For  such  a 
path,  we  define  Tail(p)  :=  xn  and  Tail^p)  :=  x'n.  With 
(p\(xn+±,  x'n+l))  we  denote  the  path 

((^0,  X0),  •  •  •  ,  *£71)3  ('t'ro+l?  %n+l))  • 

The  set  Paths(G)  is  defined  inductively  as  follows: 

•  For  each  blockable  node  x  of  G  that  is  a  successor  of  a 
nominal  node  or  a  root  node,  (( x ,  x))  £  Paths(G),  and 

•  For  a  path  p  £  Paths(G)  and  a  blockable  node  y  in  G: 

-  if  y  is  a  successor  of  Tail(p)  and  y  is  not  blocked,  then 
(p\(y,y))  e  Paths(G),  and 

-  if  y  is  a  successor  of  Tail(p)  and  y  is  blocked,  then 

(p\(b(y),y))  £  Paths(G). 

Please  note  that,  due  to  the  construction  of  Paths,  all  nodes 
occurring  in  a  path  are  blockable  and,  for  p  £  Paths(G) 
with  p  =  (p'\(x,x')),  x  is  not  blocked,  x '  is  blocked  iff 
x  /  x1,  and  x'  is  never  indirectly  blocked.  Furthermore,  the 
blocking  condition  implies  F(x)  =  F(x'). 

Next,  we  use  Nom(G)  for  the  set  of  nominal  nodes  in  G, 
and  define  a  tableau  T  =  (S,  £/,  £)  from  G  as  follows. 

S  =  Nom(G)  U  Paths(G) 

r,(n- £(Tail(p))  if P  £  Paths(G) 

*  y  L(p)  ifpeNom(G) 

£(/?)  =  Ei  U  E2  U  E3  U  E±,  where 
Ex  =  {( P,q )  e  Paths(G)  x  Paths(G)  | 

q  =  ( p\(x,x '))  and  x'  is  an  /(-successor  of  Tail (p),  or 
p  =  (r?| (at, at/))  and  x'  is  an  lnv(f?)-successor  of  Tail(g)} 
E2  =  {(p,x)  £  Paths(G)  x  Nom(G)  | 

x  is  an  /(-neighbour  of  Tail(p)} 
E3  =  {(x,p)  £  Nom(G)  x  Paths(G)  | 

Tail(p)  is  an  /(-neighbour  of  x} 
E4  =  {(x,y)  £  Nom(G)  x  l\lom(G)  | 

y  is  an  /(-neighbour  of  x} 


We  already  commented  above  on  S,  and  £/  is  straightfor¬ 
ward.  Unfortunately,  £  is  slightly  cumbersome  because  we 
must  distinguish  between  blockable  and  nominal  nodes. 
Claim:  T  is  a  tableau  for  Cq  with  respect  to  7Z. 

Firstly,  by  definition  of  the  algorithm,  there  is  an  heir  Xq 
of  ro  with  Co  G  £(xo).  By  the  ^-rule,  xq  is  either  a  root 
node  or  a  nominal  node,  and  thus  cannot  be  blocked.  Hence 
there  is  some  s  £  S  with  Co  £  £/(s).  Next,  we  prove  that 
T  satisfies  each  (Pi). 

•  (PI),  (P2),  (P5)  and  (P6)  are  ttivially  implied  by  the  def¬ 
inition  of  £/  and  completeness  of  G. 

•  (P3)  and  (P17)  follow  from  the  consttuction  of  £  and 
completeness  of  G,  and  (P4)  follows  from  clash-freeness. 

•  for  (P7),  consider  a  tuple  (s,  t)  £  £(//)  with  VB(p).C  £ 

£j'(s)  and  p  — >  q  £  B(p).  We  have  to  show  that 
\/B(q).C  £  and  distinguish  four  different  cases: 

-  if  (s,t)  £  Paths(G)  x  Paths(G),  then  VB(p).C  £ 
£(Tail(s))  and 

*  either  Tail^f)  is  an  //-successor  of  Tail(s).  Hence 
completeness  implies  \/B(q).C  £  £(Tail '(t)),  and  by 
dehnition  of  Paths(G),  either  Tail/(i)  =  Tail(f),  or 
Tail(f)  blocks  Tailr(t)  and  the  blocking  condition  im¬ 
plies  ^(Tail^f))  =  £(Tail(£)). 

*  or  Tail'(s)  is  an  lnv(//)-successor  of  Tail(£).  Again, 
either  Tail/(s)  =  Tail(s),  or  Tail(s)  blocks  Tail^s) 
in  which  case  the  blocking  condition  implies  that 
VB(p).C  £  C(Tail'(s)),  and  thus  completeness  im¬ 
plies  that  VB(q).C  £  £(Tail(f)). 

-  if  (s,  t)  £  Nom(G)  x  Nom(G),  then  \/B(p).C  £  L(s) 
and  t  is  an  //-neighbour  of  s.  Hence  completeness  im¬ 
plies  MB{q).C  £  £(£). 

-  if  (s,t)  £  Nom(G)  x  Paths(G),  then MB(p).C  £  C(s) 

and  Tail(f)  is  an  //-neighbour  of  s.  Hence  complete¬ 
ness  implies  VB(q).C  £  £(Tail(f)). 

-  if  ( s,t )  £  Paths(G)  x  Nom(G),  then  \/B(p).C  £ 
£(Tail(s))  and  /  is  an  //-neighbour  of  Tail(s).  Hence 
completeness  implies  VB(q).C  £  U(f). 

In  all  four  cases,  by  dehnition  of  £/,  we  have  \/B(q).C  £ 

£'(/). 

•  (P8)  and  (P9)  follow  from  completeness  of  G. 

•  for  (P10),  consider  some  s  G  S  with  3/Z.C  £  £/(s). 

-If  s  £  Paths(G),  then  3R.C  £  £(Tail(s)),  Tail(s)  is 
not  blocked,  and  completeness  of  T  implies  the  exis¬ 
tence  of  an  //-neighbour  y  of  Tail(s)  with  C  £  U(y). 

*  If  y  is  a  nominal  node,  then  y  £  S,  C  £  £/(y),  and 

( s,y )  £  £(//). 

*  If  y  is  blockable  and  a  successor  of  Tail(s),  then 
{s\(y,y))  G  S,  for  y  =  y  or  y  =  b(y),  C  £ 

//)))>  and  (s,(s\(y,y)))  £  £(//). 

*  If  y  is  blockable  and  a  predecessor  of  Tail(s),  then 

s  =  (p\{y,  2/) I  (Tail(s) ,  Tail'(a))),  C  £  &'((p\(y,y))), 
and 

{s,{p\{y,y)))  g  £(//). 

-  If  s  £  Nom(G),  then  completeness  implies  the  exis¬ 
tence  of  some  //-successor  x  of  s  with  C  £  L(x). 


*  If  x  is  a  nominal  node,  then  ( s,x )  £  £(//)  and  C  £ 

L'(x). 

*  If  x  is  a  blockable  node,  then  x  is  a  safe  //-neighbour 
of  s  and  thus  not  blocked.  Hence  there  is  a  path  p  £ 
Paths(G)  with  Tail(p)  =  x,  ( s,p )  G  £(//)  and  C  G 
£'(P). 

•  (Pll)  and  (P12)  are  immediate  consequences  of  the  def¬ 
inition  of  “//-successor”  and  “//-neighbour”,  as  well  as 
the  dehnition  of  £. 

•  for  (P13),  consider  some  s  £  S  with  (^ nR.C )  G  U'(s). 
Clash-freeness  implies  the  existence  of  at  most  n  R- 
neighbours  yi  of  s  with  C  £  £(f/i).  By  construction,  each 
t  £  S  with  ( s,t )  G  £(//)  corresponds  to  an  //-neighbour 
yi  of  s  or  Tail(s),  and  none  of  these  //-neighbours  gives 
rise  to  more  than  one  such  yi.  Moreover,  since  £/(£)  = 
£(f/j),  (P13)  is  satished. 

•  for  (P14),  consider  some  s  £  S  with  (^ nR.C )  G  C'(s). 

-  if  s  £  Nom(G),  then  completeness  implies  the  exis¬ 
tence  of  n  safe  //-neighbours  y\ , . . . ,  yn  of  s  with  and 
yj  ^  y:j ,  for  each  i  ^  j,  and  C  £  L(yi),  for  each 
1  <  i  <  n.  By  construction,  each  y,  corresponds  to  a 
tt  £  S  with  fj  ^  tj,  for  each  i  7/  j: 

*  if  yi  is  blockable,  then  it  cannot  be  blocked  since  it 
is  a  safe  //-neighbour  of  s.  Hence  there  is  a  path 
(p\(yi,yi))  G  S  and  (s,  {p\{yu  Vi)))  G  £(//). 

*  if  yi  is  a  nominal  node,  then  (s,  y,;)  £  £(//). 

-  if  s  £  Paths(G),  then  completeness  implies  the  ex¬ 
istence  of  n  //.-neighbours  yi, ...  ,yn  of  Tail (s)  with 
yj  ^  yj,  for  each  i  ^  j,  and  C  £  L(yi),  for  each 
1  <  1  <  n.  By  construction,  each  y,  corresponds  to  a 
ti  £  S  with  ti  ^  tj,  for  each  i  ^  j: 

*  if  yi  is  safe,  then  it  can  be  blocked  if  it  is  a  succes¬ 
sor  of  Tail(s).  In  this  case,  the  “pair”  construction 
in  our  dehnition  of  paths  ensure  that,  even  if  6(y,  )  = 
b(yj),  for  some  i  ^  j,  we  still  have  {p\(b(yi),  y^)  ^ 
(p\(b(yj),bj)). 

*  if  yi  is  unsafe,  then  ( s ,  y.j)  £  £(//). 

Hence  all  /,  are  different  and,  by  construction,  C  £ 
£j'(ti),  for  each  1  <  i  <  n. 

•  (P15)  is  satished  due  to  completeness  of  G  and  the  fact 
that  each  t  £  S  with  ( s,t )  £  £(//)  corresponds  to  an  R- 
neighbour  of  s  (in  case  s  £  Nom(G))  or  of  Tail(s)  (in 
case  s  £  Paths(G)). 

•  (P16)  and  (P18)  follow  from  clash-freeness  and  dehni¬ 
tion  of  £,  (P19)  follows  ttivially  from  the  initialisation  of 
G,  and  (P20)  is  due  to  completeness  of  G  and  the  fact 
that  nominal  nodes  are  not  “unravelled”. 

For  the  “only  if"  direction,  given  a  tableau  T  =  (S,  £) 

for  Co  w.r.t.  7 Z,  we  can  apply  the  non-deterministic  rules, 
i.e.,  the  U-,  choose-,  ^-,  and  AW -rule,  in  such  a  way  that  we 
obtain  a  complete  and  clash-free  graph:  inductively  with  the 
generation  of  new  nodes,  we  define  a  mapping  tt  from  nodes 
in  the  completion  graph  to  individuals  in  S  of  the  tableau  in 
such  a  way  that. 


1.  for  each  node  x,  L(x)  C  L'(7t(x)), 

2.  for  each  pair  of  nodes  x,  y  and  each  role  R,  if  y  is  an  R- 
successorof  x,  then  (tr(x),  n(y)}  £  L(R),  and 

3.  x  y  implies  tt(x)  ^  7 r(y). 

This  is  analogous  to  the  proof  in  Horrocks,  Sattler,  & 
Tobies  (1999)  with  the  additional  observation  that,  due  to 
(P20),  application  of  the  o-rule  does  not  lead  to  a  clash  of 
the  form  (7)  as  given  in  Definition  15.  Similarly,  an  appli¬ 
cation  of  the  Self-  Ref-rule  does  not  lead  to  a  clash  of  the 
form  (3)  due  to  Condition  (P4),  a  clash  of  the  form  (4)  can 
not  occur  due  to  (P16),  and  a  clash  of  the  form  (5)  can  not 
occur  due  to  (P18). 

□ 

From  Theorems  9,  14  and  16,  we  thus  arrive  at  the  fol¬ 
lowing  theorem: 

Theorem  17  (Decidability)  The  tableau  algorithm  decides 
satisfiability  and  subsumption  ofSIZOTQ-concepts  with  re¬ 
spect  to  Aboxes,  Rboxes,  and  Tboxes. 

Outlook  and  Future  Work 

We  introduced  a  description  logic,  S1Z01Q ,  that  overcomes 
certain  shortcomings  in  expressiveness  of  other  DLs.  We 
have  used  STiOIQ  and  1ZTQ  as  a  starting  point,  extended 
them  with  “useful-yet-harmless”  expressive  means,  and  ex¬ 
tended  the  tableau  algorithm  accordingly.  SIZOTQ  is  in¬ 
tended  to  be  a  basis  for  future  extensions  of  OWL,  and  has 
already  been  adopted  as  the  logical  basis  of  OWL  1.1. 

It  is  left  for  future  work  to  determine  whether  the  restric¬ 
tions  to  simple  roles  can  be  relaxed,  to  pinpoint  the  exact 
computational  complexity  of  S1Z01Q ,  and  to  include  fur¬ 
ther  role  assertions  such  as  the  more  general  version  of  an¬ 
tisymmetry  to  allow  a  better  modeling  of  mereological  no¬ 
tions  (Goodwin,  2005). 

A  further  line  of  investigation  concerns  concrete  datatypes 
with  inverse  functional  datatype  properties:  these  are  of  in¬ 
terest  since  they  allow  to  express  simple  key  constraints.  For 
instance,  we  might  want  to  use  a  datatype  property  SSN  for 
social  security  number  as  a  key  for  US  citizen. 
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Appendix  M: 

FaCT++  source  code  as  of  21  April  2006.  Enclosed  with  the  full  electronic  version  of  this  report 
as  the  file  FaCT++src.tar. 
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